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The invention is specifically directed to efficient, random, simple insertion of a 
transposon or derivative transposable element into DNA in vivo or /// vitro. The invention 

1? is particularly directed to mutations in A TP-utilizing regulatory transposition proteins that 
permit insertion with less target-site specificity than wild-type. The invention encompasses 
gain-of-funetion mutations in TnsC, an A TP-utilizing regulator) transposition protein that 
activ ates the bacterial transposon Tn7. Such mutations enable the insertion of a Tn7 
transposon or derivative transposable element in a non-specific manner into a given DNA 

20 segment. Insertion can be effected in plasmid and cosmid libraries. cDNA libraries. PCR 
products, bacterial artificial chromosomes, yeast artificial chromosomes, mammalian 
artificial chromosomes, genomic DNAs. and the like. Such insertion is useful in DNA 
sequencing methods, for genetic analysis by insertional mutagenesis, and alteration of gene 
expression by insertion of a given genetic sequence. 
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BACKGROUND OF THE INVEN TION 



1 




Description of the Background Art 

Transposable elements are discrete segments of DNA capable of mobilizing 
nonhomologously from one genetic location to another, that typically earn' sequence 
5 information important for two main functions that confer the ability to mobilize. They 
encode the proteins necessary to carry out the catalytic activity associated with 
transposition, and contain the cis-aeting sequences, located at the transposon termini, that 
act as substrates for these proteins. The same proteins can participate in the selection of the 
target site for insertion. 

10 The selection of a new insertion site is usually not a random process; instead, many 

transposons show characteristic prcfciences tor certain types of target sites. One broad 
characteristic that differentiates the wide variety of transposable elements know n is the 
nature of the target site selectivity ( 1 ). A component of this selectivity can be the target 
sequence itself. The bacterial transposon I n 10 preferentially selects a relatively highly 

15 conserved 9 bp motif as the predominant site for transposon insertion and less often selects 
other more distantly related sites in vivo (2). The Tel and Tc3 mariner elements of C. 
e/egans insert preferentially at a TA dinucleotide such that each end of the element is 
Hanked by a TA duplication (3) (4) (5). A lower specificity consensus sequence, N-Y-G C - 
R-N has been determined from populations of both in vivo and in vitro insertions for the 

20 bacteriophage Mu (7). In contrast to these elements, the bacterial transposon Tn5 exhibits 
markedly lower insertion site specificity, although some isolated "hotspots" have been 
detected (8). 

Another selection mechanism relies on structural features or presence of cellular 
protein complexes at the target sites. The yeast transposon T\ 3 preferentially inserts into 
25 the promoters of genes transcribed by RNA polymerase III. responding to signals from 
cellular proteins TFHIB and TITHC ( l >). 

l T nderstanding how these factors modulate transposase activity to impose target site 
preferences will lend insight into the spread of transposons and viruses, and may suggest 
ways to manipulate those target preferences. The bacterial transposon Tn7 is distinctive in 
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that it uses several element-encoded accessor) proteins to evaluate potential target DNAs 
tor positive and negative features, and to select a target site ( 1 >. Tn7 encodes five genes 
whose protein products mediate its transposition ( 10) ( 1 1 ). 

Two of the proteins. TnsA and TnsB. constitute the transposase activity. 
5 collaborating to execute the catalytic steps of strand breakage and joining (12). The activity 
of this transposase is modulated by the remaining proteins. TnsC, TnsD. and TnsH, and also 
by the nature of the target DNA. 

TnsC. TnsD. and TnsH interact with the target DNA to modulate the activity of the 
transposase via two distinct pathways. TnsABC + TnsD directs transposition to attTn", a 

10 discrete site on the E. coli chromosome, at a high frequency, and to other loosely related 
"pseudo att" sites at low frequency ( 13). I he alternative combination TnsABC + E directs 
transposition to many unrelated non-alt in" sites in the chromosome at low frequency (13) 
( 10) ( 1 1) and preferentially to conjugating plasmids (14). Thus, attTn~ and conjugable 
plasmids contain positive signals that recruit the transposon to these target DNAs. The 

15 alternative target site selection mechanisms enable Tn7 to inspect a variety of potential 
target sites in the cell and select those most likely to ensure its survival. 

The Tn7 transposition machinery can also recognize and avoid targets that are 
unfavorable for insertion. Tn7 transposition occurs only once into a given target molecule: 
repeated transposition events into the same target are specifically inhibited ( 15) ( 16). 

20 Therefore, a pre-existing copy of Tn7 in a potential target DNA generates a negative signal 
which renders that target "immune" to further insertion. The negative target signal affects 
both TnsD- and TnsK-activated transposition reactions and is dominant to any positive 
signals present on a potential target molecule ( 16). Sev eral other transposons. such as Mu 
and members of the Tn3 family, also display this form of negative target regulation ( 17) 

25 (18)(N)(7). 

Target selection could be an early or late event in the course of a transposition 
reaction. For example, a transposon could constitutively excise from its donor position, 
and the excised transposon could then be captured at different frequencies by different 
types of target molecules. Tnl 0 appears to follow this course of ev ents in vitro, excising 
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from its donor position before any interactions with target DNA occur (20) (21 ). 
Alternatively, the process of transposon excision could itself be dependent on the 
identification of a favorable target site. Tn7 transposition shows an early dependence on 
target DNA signals /// vitro: neither transposition intermediates nor insertion products are 

5 seen in the absence of an attTn" target (22). Thus, the nature of the target DNA appears to 
regulate the initiation of Tn7 transposition in vitro. 

An important question is how positive and negative target signals are 
communicated to the Tn7 transposase. Reconstitution of the TnsABC -r TnsD reaction in 
vitro has provided a useful tool for detailed dissection of Tn7 transposition (22) (23). This 

10 reaction has been instrumental in delineating the role of each of the individual proteins play 
in target site selection. Dissection of the TnsABOD reaction in vitro has implicated TnsC 
as a pivotal connector between the TnsAB transposase and the target DNA. TnsC is an 
ATP-dependent DNA-binding protein with no known sequence specificity (24). However. 
TnsC can respond to signals from attTn 7 via an interaction with the site-specific DNA- 

15 binding protein TnsD. In a standard in vitro transposition reaction TnsD is required for 
transposition to the attTn" site on a target DNA molecule. This site-specific insertion 
process is tightly regulated by TnsC. but does not occur in the absence of InsD. Additional 
evidence for a TnsC-TnsD interaction comes from DNA protection and band shift analysis 
with attTn" DNA (23). Direct interaction between TnsC and the TnsAB transposase has 

20 also recently been observed (25) (26). 

Therefore. TnsC may serve as a "connector" or ""matchmaker" between the 
transposase and the TnsD+</rt7/?~ target complex (23) (27). This connection is not 
constitutive, but instead appears to be regulated by the ATP state of TnsC. Only the A 1 P- 
bcuind form of TnsC is competent to interact with target DNAs and activate the TnsA-rB 

25 transposase; the ADP-bound form of TnsC has neither of these activ ities and cannot 

participate in Tn7 transposition (24) (23). TnsC 1 hydroly/es ATP at a modest rate (25). and 
therefore can sw itch from an active to an inactiv e state. The modulation of the ATP state 
of TnsC may be a central mechanism for regulating 1 n7 transposition. 
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The possibility that TnsC regulates the connection between the TnsA+B transposase 
and the target site prompted the inventor to predict that TnsC mutants can be isolated that 
would constitutively acti\ ate Tn7 transposition. 

TnsC therefore became an excellent candidate for mutagenesis, to search tor a gain 
5 of function protein capable of circumv enting the requirement tor targeting proteins. The 
inv entor therefore identified gain-of-function TnsC mutants which can activate the TnsA-B 
transposase in the absence of TnsD or Tnsli. They hav e characterized the ability of these 
mutants to promote insertions into various targets, and to respond to regulator}' signals on 
those targets. 

K) One class of TnsC mutants activates transposition in a way that is still sensitive to 

target signals, whereas a second class of 1 nsC mutants activates transposition in a way that 
appears to bypass target signals. As had been observed in vitro, the critical communication 
between the transposon and the target DNA appears to be an early event in the Tn7 reaction 
pathway in vivo, preceding the double-strand breaks at the transposon ends that initiate 

1 5 transposition. 

A particular mutant isolated from the random mutagenesis is TnsC A ^\ a mutant 
capable of an impressive activation of Tn7 transposition in the absence of TnsD (25). The 
single amino acid substitution made to generate TnsC A: ~ A has altered the protein such that 
it no longer requires an interaction with the target-associated TnsD. enabling it to activate 
20 transposition to a v ariety of target molecules very efficiently (25) (26). The inventor 

concluded that TnsC^"^ could promote transposition to target DNAs with low specificity 
based on results where transposition driven by the TnsABC A "^ machinery was directed to 
either F plasmids containing an atiTn' site. F plasmids lacking an uttTn" site, or the /;. coli 
chromosome with no apparent preference. 

25 

D\A Sequencing 

Sequencing DNA fragments cloned into vectors requires provision of priming sites 
at distributed locations w ithin the fragment of interest, if the fragment is larger than the 
sequence run length (amount of sequence that can be determined from a single sequencing 
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reaction). At present there are three commonly used methods of providing these priming 
sites: 

A) Design of a new primer from sequence determined in a previous run from 
vector-encoded primer or other previously determined primer (prime and run. primer 

5 walking) 

B) Random fragmentation and recloning of smaller pieces, followed by 
determination of the sequence of the smaller pieces from vector-encoded (universal) 
priming sites, followed by sequence assembly by ov erlap of sequence (random shotgun 
sequencing). 

l o C ) Deletion of variable amounts of the fragment of interest from an end 

adjacent to the vector, to bring undetermined fragment sequence close enough to the vector- 
encoded (universal) primer to allow sequence determination. 
All of these methods have disadvantages. 

Method A is time-consuming and expensiv e because of the delay involved in design 
15 of new primers and their cost. Moreover, if the fragment contains DNA repeats longer than 
the sequence run, it may be impossible to design a unique new primer; sequence runs made 
with primers within the repeat sequence will display two or more sequences that cannot be 
disentangled. 

Method B requires recloning; random fragmentation is difficult to achieve because 
20 fragments that are efficiently clonable (restriction enzyme digestion) do not have ends 

randomly distributed (Adams. M.D.. Fields. C. and Ventor, J. C. editors Automated DNA 
Sequencing and Analysis Academic Press 1994; Chapter 6. Bodenteich. K. et al.), and 
fragmentation methods that provide randomly distributed ends (shearing, sonication) do not 
provide DNA ends that are efficiently clonable (with 5' phosphate and 3' OI 1 moieties). 
25 Sequence assembly of is also difficult or impossible when two or more repetitive sequences 
longer than the sequence run are present in the starting fragment. 

Method C depends on prov iding randomly distributed end points for en/yniatically - 
determined deletions. There are many methods for making such deletions (especialh those 
involving exonuelease digestions, typically l:\onuclease 111), none of which provide 
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entirely random endpoints and which depend on the presence of unique suitable restriction 
enzyme sites at one or both ends of the cloned fragment. Howev er, because the deletion 
series in principle allows construction of a map (ot nested remaining fragment lengths in 
deletion derivatives) that is independent of the sequence itself, this method can allow 
5 repetitive sequence longer than the sequence run to be located within the fragment at 
appropriate locations. 

A method for introduction of universal priming sites at randomly distributed 
locations w ithin a fragment of interest is therefore a useful advance in sequencing 
technology. 

1 0 Transposition and the sequencing problem. 

Previous efforts have been made to provide distributed priming sites by means of 
transposable elements. These methods have fallen short of this goal in three ways: first, 
the transposable elements have not provided a sufficiently random distribution of priming 
sites; second, the transposition method (carrying out transposition in vivo, followed by 
15 recovery of the targeted DNA and repurification) has been time-consuming and laborious; 
third, the Systems have been prone to produce undesired products. These undesired 
products include but are not limited to: a) cointegrates (replicon fusions) between the donor 
of the transposon and the target plasmid; b) insertions in w hich the two ends of the 
transposon act at different positions (leading to deletion of the intervening target); c) 
20 insertions of multiple copies of the transposon into the target, so that priming from one end 
of the transposon yields two superimposed sequences. The method has been laborious in 
two ways: the majority of insertions have been into chromosomal DNA of the host, and 
even for those insertions into the plasmid the recover} method has entailed loss of 
independence of insertions, in vitro methods of insertion have suffered from both the non- 
25 random location of insertion sites and the undesired products, and also from poor 

efficiency, so that it has been impractical to obtain large numbers of insertions into the 
target of interest w ithout excessiv e labor. 

Increasing interest in large scale sequencing projects and a concomitant search for 
highly efficient in vitro mutagenesis methods has promoted the adaptation of several in 
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vitro transposon systems as tools to study genomes. An in vivo reaction tor the bacterial 
transposon Tn3 has been used to efficiently sequence plasmid inserts of variable lengths; 
how ev er, only approximately 37% of the nucleotides were found to be capable of serving as 
sites for insertion (Davies. 1995 #419). A similar, more random system has been 
5 dev eloped for yeast retrotransposon Tyl, employing synthetic transposons with IJ3 ends as 
substrates and Tyl virus-like particles supplying transposition functions (28) to sequence 
plasmids with yeast and human DNA inserts. A disadvantage to this method is the 
requirement for the cumbersome preparation of VLPs. In vitro transposition with an MLV 
integrase system has been utilized as a tool to dissect some of the mysteries of chromatin 

10 packaging (29) (30) (3 1 ) and as a tool for functional genetic footprinting (32). However, 
the MLV inserti ons do no i appear to be completely random. An object of the invention 
therefore is to provide a transposon and transposition reaction with more random target site 
specificity. Therefore, the inventor examined the target site selectivity of the TnsC A22rA 
machinery in vitro and explored the viability of this reaction as an effective tool for random 

15 insertional mutagenesis. 



BRIEF SUMMARY OF THE INVENTION 
Accordingly, a general object of the invention is to provide a transposable system 
20 that achieves efficient, simple, non-specific or random insertion into any given DNA 
segment. 

A further object of the invention is to provide a transposable system that achiev es 
efficient random insertional mutagenesis via simple insertion. 

Therefore, a specific object of the invention is to provide a transposable system that 
25 achieves efficient target site specificity that is reduced from wild-type and preferably 
random, via simple insertion. 

A more particular object of the invention is to provide a transposon containing a 
mutation in a transposon-derived protein that allows efficient, simple insertion and target 
site selectivity that is reduced from the wild-type, and preferably random. 
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A more particular object of the invention is to provide a transposable system with a 
mutation in a transposon-derived A I P-utilizing regulatory protein. The mutation allows 
the ef ficient, simple, non-specific or random insertion of the transposable element into a 
DNA segment or at least provides reduced target site specificity from the wild-type. 
5 A preferred object of the inv ention is to provide a Tn7 transposable system that 

achieves simple, efficient, non-specific or random insertion into a given DNA segment, or 
at least reduced target site specificity compared to the wild-type Tn7. 

.A preferred object of the invention is to provide a mutation in the Tn7 transposon 
that confers efficient, simple, non-specific insertion into a given DNA segment, or at least 
10 reduced target site specificity compared to the wild-type Tn7. 

A preferred object of the inv ention is to provide a Tn7 transposable system with a 
mutation in the TnsC protein encoded in the Tn7 transposon. which mutation allows 
efficient, simple insertion with reduced target site specificity compared to the wild-type, 
and preferably allows non-specific insertion into a DNA segment. 
15 Objects of the invention include methods for using the above compositions. 

Accordingly, a general object of the invention is to provide a method for efficient, 
simple, random insertion of a transposable element into a given DNA segment. 

A further object of the invention is to provide a method for efficient, simple, 
random insertional mutagenesis by a transposable element. 
20 A specific object of the invention is to provide a method for efficient, simple, 

random transposition of a transposable element into a DNA segment, or in which the 
specificity of transposition is reduced compared to wild-type. 

A more particular object of the invention is to provide a method for efficient, 
simple, random transposition of a transposable element into a DNA segment in which the 
25 specificity of transposition is reduced compared to the wild-type by using a transposable 

system containing a mutation that confers efficient, simple insertion with reduced target site 
specificity compared to the wild-type, and preferably random insertion. 

A more particular object of the invention is to prov ide a method for efficient, 
simple, random transposition of a transposable element into a DNA segment or in which 
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the specificity of transposition is reduced compared to wild-type, by using a transposable 
system with a mutation in an ATP-utili/ing regulatory protein, the mutation allow ing the 
efficient, simple, non-specific insertion of the transposable element into a DNA segment or 
at least providing for reduced target site specificity compared to the wild-type. 
5 A preferred object of the invention is to provide a method for efficient, simple 

transposition of a transposable element into a DNA segment in which the specificity of 
transposition is reduced compared to wild-type, or is preferably random, by providing a 
Tn7 transposable system that is capable of non-specific insertion into a DNA segment, or at 
least reduced target site specificity compared to the wild-type Tn7. 
10 A further object of the invention is to provide a method for efficient, simple 

transposition of a transposable clement transposon into a DNA segment in which 
specificity of transposition is reduced compared to wild-type or is preferably random by 
providing a Tn7 mutation that confers the efficiency, ability to make a simple insertion, and 
the randomness or reduced specificity. 
15 A further object of the invention is to provide a method for efficient, simple, 

random transposition of a transposable element into a DNA segment, or in which the 
specificity of transposition is reduced compared to the wild-type, by providing a mutation 
in the TnsC protein encoded in the Tn7 transposon. the mutation allowing a reduction in 
target site specificity compared to the wild-type and preferably allowing non-specific or 
20 random insertion of the Tn7 transposable element into a DNA segment. 

A further object of the invention is to provide a method for DNA sequencing using 
a transposable system to introduce priming sites at randomly-distributed locations within a 
fragment of interest where the fragment is larger than the sequence run length. 

A preferred object of the inv ention is to provide a method for DNA sequencing 
25 using a transposable system with a mutation that allows efficient and simple insertion and 
target site selectivity that is reduced from the wild-type and preferably random. 

A preferred object of the inv ention is to prov ide a mutation in an ATP-utili/ing 
regulatory protein. The mutation allows the efficient, simple, non-specific insertion of the 
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transposon into a DNA segment or at least provides reduced target site specificity over 
wild-type. 

A highly preferred object of the invention is to provide a method for DNA 
sequencing using a Tn7 transposahle system that allows efficient, simple, non-specific 
5 insertion into a DNA segment or at least reduced target site specificity compared to the 
wild-type Tn7. 

A highly preferred object of the invention is to provide a method for DNA 
sequencing using a Tn7 transposahle system with a mutation in the TnsC protein, the 
mutation allowing efficient, simple insertion and a reduction in target site specificity 

10 compared to the wild-type and preferably allowing non-specific or random insertion of the 
Tn7 transposahle element into me DNA segment. 

A further object of the invention is to provide methods as described above that can 
be applied to any given DNA segment. These include, but are not limited to. plasmids, 
cellular genomes, including prokaryotic and eukaryotic, bacterial artificial chromosomes. 

15 yeast artificial chromosomes, and mammalian artificial chromosomes, and subsegments of 
any of these. 

An object of the invention is to provide these methods in vitro or in vivo. 

A further object of the invention is to prov ide kits for carrying out the above- 
described methods using the above-described transposons or parts thereof. 
20 The inventor has accordingly developed a transposahle system and methods that 

improve on in vitro and in vivo transmission methods previously described in that the 
methods are efficient for transposition, provide relatively random insertion, and almost all 
products recovered are simple insertions at a single site which thus provide useful 
information. 

25 In a general embodiment of the invention, the invention is directed to a transposahle 

system that achieves simple, efficient, random insertion into a given DNA segment. 

In a further embodiment of the invention, the invention is directed to a transposahle 
system that is capable of efficient random insertional mutagenesis, preferably b> means of a 
simple insertion. 

1 1 
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In a specific embodiment of the invention, the invention is directed to a 
transposable system with target site specificity that is reduced from the wild-type and 
preferably random, which allows simple and efficient insertion. 

In a further specific embodiment of the invention, the invention is directed to a 
5 transposable system containing a mutation that allows target site specificity that is reduced 
from the wild-type and is preferably random. 

In a preferred embodiment of the invention, the invention is directed to a 
transposable system with a mutation in an A TP-utilizing regulator) protein, the mutation 
allow ing the ef ficient, simple, non-specific insertion of the transposon into a DNA segment 
10 or at least providing reduced target site specificity from the wild-type. 

In a highly preferred embodiment of the invention, the invention is directed to a Tn7 
transposable system that achieves efficient, simple, non-specific insertion into a given DNA 
segment, or at least reduced target site specificity compared to the wild-type Tn7. 

In a highly preferred embodiment of the invention, the invention is directed to a 
15 mutation in a Tn7 transposon that confers the capability of efficient, simple, non-specific 
insertion into a DNA segment, or at least reduced target site specif city compared to the 
wild-type Tn7. 

In a highly preferred embodiment of the invention, the invention is directed to a 
mutation in the TnsC protein encoded in the Tn7 transposon. the mutation allowing simple. 
20 efficient insertion and a reduction in target site specificity compared to the wild-type and 
preferably allow ing non-specific or random insertion of the Tn7 transposition into a DNA 
segment. 

In a specific disclosed embodiment of the invention, the invention is directed to a 

\22 <1 V 

Tn7 mutant designated TnsC ' . which is a mutant having an alanine to valine 
25 substitution at amino acid number 225 in the TnsC gene. 

The invention also embodies methods for using all of the above compositions. 
Methods are directed to transposition or insertion of the transposable elements described 
above. 
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Accordingly, in one embodiment, the invention provides generally for efficient, 
simple, random insertion of a transposon into a given DNA segment, or at least insertion 
with reduced specificity compared to the wild-type. 

In a further embodiment of the invention, the invention is directed to methods for 
5 insertional mutagenesis using a transposable system that is capable of efficient, simple, 
random insertion or at least insertion with reduced specificity compared to wild-type. 

In a further embodiment of the invention, the invention is directed to methods for 
insertion of a transposable element into a DNA segment in which target site specificity is 
reduced from wild-type and is preferably random, where insertion is efficient and simple. 
10 In a further embodiment of the invention, the invention is directed to methods for 

insertion of a transposable element into a DNA segment, by providing a transposable 
element containing a mutation that allows efficient and simple insertion and target site 
specificity that is reduced from the wild-type and is preferably random. 

In a preferred embodiment of the invention, the invention is directed to methods for 
15 inserting a transposable element into a DNA segment by providing a transposable system 
with a mutation in an A I P-utilizing regulatory protein, the mutation allowing simple, 
efficient, and non-specific insertion of the transposon into a DNA segment, or at least 
providing reduced target site specificity from the wild-type. 

In a highly preferred embodiment of the invention, the inv ention is directed to 
20 methods for inserting a transposable element into a DNA segment by providing a Tn7 
transposable system allowing efficient, simple, non-specific insertion into a given DNA 
segment or at least reduced target site specificity compared to the wild-type Tn7. 

In a highly preferred embodiment of the invention, the invention is directed to a Tn7 
transposable system with a mutation that allows simple, efficient, and non-specific 
25 insertion of a transposable element into a DNA segment or at least provides reduced target 
site specificity from the wild-type Tn7. 

In a highly preferred embodiment of the inv ention, the inv ention is directed to 
methods for inserting a transposable element into a DNA segment by providing a Tn7 
transposable system with a mutation in the TnsC protein, the mutation allowing efficient 
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and simple insertion and a reduction in target site specificity compared to the wild-type and 
preferably allowing non-specific or random insertion of the Tn7 transposition into a DNA 
segment. 

In a specific disclosed embodiment of the inv ention, the invention is directed to 
5 methods for inserting a transposable element into a DNA segment, by providing the Tn7 
mutant I nsC ~" . 

The invention also provides kits for performing the above-described methods and 
the methods further described herein. In a preferred embodiment, a kit is supplied whose 
components comprise a mutant ATP-utilizing regulator}' protein derived from a transposon, 
10 the mutation allowing efficient, simple, non-specific insertion of the transposon into a 

given DNA segment. The kit also provides a transposable element which can be found as 
part of a larger DNA segment; for example, a donor plasmid. The kit can further comprise 
a buffer compatible w ith insertion of the transposable element. The kit can further 
comprise a control target sequence, such as a control target plasmid. for determining that all 
15 of the ingredients are functioning properly. For DNA sequencing, the kit can further 
comprise sequencing extension primers with homology to one or more sites in the 
transposable element. Primers can have homology to sequences outside the transposable 
element (i.e. in a target vehicle). 

In the kits, the mutant protein may be added as a purified protein product, may be 
20 encoded in the transposable element and produced therefrom, or encoded on vectors 
separate from the transposable segment, to be produced in vivo. 

It is to be understood that the invention encompasses transposable systems with 
varying degrees of reduction of target site specificity from the wild-type w hich are useful 
for the purposes of the invention described herein. 
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BRIFT 1)1 SC RIP HON OF THK FIGCRFS 

Figure 1 . Papillation phenotypes of the TnsC gain-of-funetion mutants. Cells were patched 
on MaeConkey lactose plates and photographed alter three days' incubation at 30 C. 
5 TnsA+B was present in each strain; the TnsC species present is indicated below each patch. 

Figure 2. Amino acid changes in the TnsC mutants. The TnsC protein sequence (SHQ ID 
NOS.l and 2) is cartooned, with the residues altered in the Class I mutants indicated above 
the protein and the Class II mutants below the protein. Matched boxes represent Walker A 
10 and Walker B motifs. 

Figure 3. TnsC mutants promote transposition to the chromosome. Frequencies of 
transposition of miniTn7-Km R from a ). phage to the chromosome were measured by the K 
hop assay. TnsA+B was present in each strain; the TnsC species present is indicated below 
15 each column. 

Figure 4. TnsC mutants promote transposition to conjugable plasmids. Frequencies of 
transposition of miniTn7-Km R from the chromosome to the conjugable target plasmid 
p()X-Ci were measured by the mating-out assay. TnsA+B was present in each strain; the 
2() TnsC species present is indicated below each column. 

Figure 5. The substrates, intermediates and products of Tn7 transposition. One substrate is 
a donor plasmid containing a mini Tn7 element which contains the essential cis-acting 
sequences at each end for transposition. The other substrate is a target plasmid. 
25 Transposition initiates w ith a double strand break at either end of the element, followed by 
a second break at the other end to generate an excised linear transposon. This excised 
transposon is then joined to the target I)N.\ to form a simple insertion. 
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Figure 6. Analysis of Tn7 transposition reactions on a agarose gel. The donor plasmid. a 
pBR derivative, contained a mini Tn7 element containing a kanamycin gene and the target 
plasmid contained an attTn~ site. Recombination reactions were carried out as described, 
the DNAs isolated from the reaction mixture by phenol extraction, digested with a 
5 restriction enzyme that cuts once in the donor backbones, displayed by electrophoresis on 
an agarose gel, transferred to a membrane by electrotransfer and hybridized w ith a probe 
specific for the miniTn7 element. Lane 1: TnsA-B: Lane 2: TnsA-B - Cut; Lane 3: 
TnsA-B - CH233K: Lane 4: TnsA+B + CS 40IYA402 . Lane 5: Tns(A+B)f C A225V 

10 Figure 7. Tn7 insertion mediated by TnsA+B + c A22 * v occurs at many different sites in a 
target DNA. In vitro transposition reactions using TnsA+B + C A22:A were carried out and 
the DNAs isolated by phenol extraction and ethanol precipitation. A PCR reaction using 
the transposition products as a template was then carried out in which one primer (NLC 
209) (SLQ ID NO:8) complementary to a sequence on the target DNA and another primer 

15 NLC 95 (SLQ ID NO:7) complementary to the left end of Tn7. The length of the PCR 
products will vary depending on the position of the Tn7 insertion, for example, insertions 
being closer to the target primer will be short (insert 1 ) and those more distant will be 
longer (insert 2). The products of the in vitro reaction were then displayed on a denaturing 
acrylamide gel by electrophoresis, transferred from the gel to membranes and analyzed by 

20 hybridization to a radioactively labeled probe that hybridizes to Tn7 sequences on one end 
of the transposon. 

Figure 8. Analysis of distribution of insertions in different regions of the plasmid. Tn7 
displays little target site selectivity at many regions of a target. In vitro transposition 
25 reactions were carried out and the products used as a template for PCR reactions as 

described above except for the target primer. In these experiments, one primer in the end 
of Tn7 (NLC 95) (SLQ ID N():7) was used and in separate reactions primers from several 
different positions in the target DNA were used. 
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Figure 9A-C. Structure of Tn7 donor plasmids. A. A plasmid contains a mini Tn7 element 
in which the essential cis-acting sequences at the element termini tlank a selectable marker. 
The translocation of the element can be readily follow ed by hybridization to a mini I n 
specific probe. Many different kinds of information could be inside the ends as a selectable 
5 (or identifable marker, for example, an antibiotic resistance gene. If the products of 

transformation are to be recovered in vivo, it is convenient to remove unreacted donor DNA 
by digestion with a restriction enzyme that is selective for the donor backbone; alternatively 
a conditional replicon can be used. B. Sequence of Donor plasmid pKM delta R.adj to 1 
(SHQ ID NO:3). Plasmid carries a 1625 bp mini-Tn7 element: 199 bp of Tn7R and 166 bp 
10 of Tn7L Hank a Kan gene with Sail sites at the junctions. The backbone is pTRCOO 
(Pharmacia); mini-Tn7 plus flanking host DNA was cloned into the Smal site. C. A 
commonly used derivative is pHM-A, (SHQ ID NO:4) a pBR plasmid containing a 
kanamycin mTn7 element. 

15 Figure 10A-B. Tn7 target plasmids. A. Sequence of Tanzet plasmid pHR 183 (SHQ ID 
N():5) . This 8.9 kb pACYC184 derivative carries chloramphenicol resistance, a pI5A 
origin of replication, and inserts carrying mcrB, mcrC, hsclS. and a segment of phage f\. A 
large target was used to detect preference of moderate complexity (up to four bp 
preferences should be detectable). In addition, different segments of the plasmid vary in 

20 G+C content from 35% to 68%. so that any preference the transposition system might 

display for a particular G+C content might be revealed. B. The major targets used in this 
work are pRM2 (SHQ ID N():6). a 3190 bp pBR derivative containing at uttTnZ segment 
and pHR183 (SHQ ID \():5). a pACYC derivative containing several /;. coli genes. 

25 Figure 11. Diagram of sequencing runs used to ascertain the positions of 63 insertions of 
mini-Tn7 into pHR 1 83 [SHQ ID N():5). Numbers at the top refer to coordinates on the 
sequence of pHR 183 (SHQ ID NO:5) displayed in fig. 1 IB. Arrows indicate the direction 
of primer extension: arrow stems cover the sequence obtained from the run. Arbitral*} 
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numbers attached to the arrows assigned by the sequence assembly program 
Al TOASSHMBLH. 

Figure 12. Graph of the observed distribution of insertions in 100-bp intervals of pHRl 83 
5 (SEP ID XO:5). and the distribution expected if the distribution were random. On the 
abcissa is the number of insertions per interval: on the ordinate is the number of intervals 
that exhibit that number of insertions. Grosses show the expected values for a random 
(Poisson) distribution of insertions along the sequence; diamonds show the observed 
values. 

10 

Figure 13. The base composition of the 5bp sequences duplicated by the process of Tn7 
insertion for the 63 sites examined. On the abcissa. sequence positions are numbered 
relative to the right end of Tn7 (Tn7R) such that position 1 is immediately adjacent, 
position 5 is 5bp away (see diagram below the graph). On the ordinate is the number of 
15 instances of a particular base at that position. All bases are well represented at all sites. 

figure 14. H fleet of four methods of stopping the transposition reaction in preparation for 
introduction into cells . Results for four replicates (abcissa) of each of four stop methods (/ 
axis), reported as number of transformants per 1 /50th of the total reaction (ordinate). 
20 Treatments were: no treatment; heat treatment at 65°C for 20 min; heat treatment at 75°C 
for 10 min; and phenol extraction followed by ethanol precipitation. Heat treatment at 
75°C but not 65°G allows effective recovery. 

Figure 15. A second experiment displaying the effect of three methods of sto pping the 
25 transposition reaction in prep aration for introduction into cells . Results are shown for two 
replicates of each of three stop methods (abcissa) for four doses of tw o different aliquots of 
TnsB (z axis), reported as number of transformants per 1 25th reaction. Treatments w ere: 
heat treatment at 75°G for 10 min: ethanol precipitation alone; or heat treatment at 65°G for 
20 min. On the / axis, two aliquots ( 1- or 2-) of TnsB were used, in four different doses. 
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l|il 1.5|il 2jul or 3jil. The row labeled 1-2, for example, employed aliquot 1 and used 2|il 
of it. Heat treatment at 75°C but not 65°C allows effective recov ery. This experiment also 
illustrates the dose-response to TnsB. 

5 Figure 16. Effect of tw o methods of storing proteins on the efficiency of the transposition 
reaction . Abcissa displays the storage conditions tested: "individually", TnsA. TnsB and 
TnsC proteins stored individually in separate tubes at -70°C; "as a mixture (A2af\ TnsA, 
TnsB and TnsC proteins stored together as a mixture at -70°C. Ordinate displays the 
number of transformants per 1 50th of the total reaction. Each treatment w as tested in 
10 quadruplicate. 

Figure 17. Effect of three methods of storing proteins on the efficiency of the transposition 
reaction . Abcissa displays the storage conditions tested: "individually", TnsA, TnsB and 
TnsC proteins stored individually in separate tubes at -70°C; "as a mixture, -70 (A2af\ 
\5 TnsA, TnsB, and TnsC proteins stored together as a mixture at -70°C "as a mixture, -20 
(A2bf \ TnsA, TnsB and TnsC proteins stored together as a mixture at -20°C. Ordinate 
displays the number of transformants per 1 50th of the total reaction. Each treatment was 
tested in quadruplicate. 

20 Figures 18A-1 SB. ISA. nucleotide sequence of TnsC (SEQ ID NO:l ). 18B. Amino acid 
sequence of TnsC (SEQ ID NOS:l and 2). 
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DETAILED DESCRIPTION OF THE INVENTION 

In the art. the term "transposon" encompasses a segment Hanked by particular cis- 
acting sites that are required for mobilization to occur, together with the genes that specify 
5 the proteins that act on those cis-acting sites to mobilize the segment defined by them, 
whether or not the protein-encoding genes lie between the sites mentioned. I or example, 
according to the present invention, a Tn7 transposon can correspond to the wild type 
transposon except that the transposon encodes a mutant TnsC. This transposon thus 
provides the protein products required for mobilization. However, an entire transposon is 
10 not necessary to practice the invention. Thus, the term ""transposon deriv ative*'. 

"transnosable element", or insertable element" as used herein can also refer to DNA 
minimally comprising the cis-acting sites at which the trans-acting proteins act to mobilize 
the segment defined by the sites. It is also understood that the sites may contain 
intervening DNA. 

15 The phrase "transposablc system" as used herein encompasses a transposon 

containing a mutation in a native A TP-utilizing regulatory protein which, when expressed 
from the transposon. allows for the non-specific target site selectivity or reduced target site 
selectivity disclosed herein. The phrase also encompasses modifications in which the 
relevant proteins are not encoded on the transposablc element but nevertheless, acts upon it 

20 to achiev e the objects of the invention. Thus, the system encompasses compositions in 
which the mutant protein is added to a transposablc element that is derived from a 
transposon but where the element contains less than the full complement of genes. The 
only limitation on this element is that it contain the cis-acting sequences upon which the 
mutant protein acts that allows integration of the element into a target DNA. I hus, the 

25 system comprises DNA with cis-acting sites (which may contain heterologous DNA 

sequences) and the trans-acting proteins that employ those sites to mobilize the segment 
defined by the sites, regardless of how they are organized in DNA. Accordingly, the 
proteins may be provided in separate plasmids or in purified form. 
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The term "transposon-derived" as used herein to refer to the mutant protein, refers 
to a derivativ e of a protein normally found on the transposon. I lowever. this need not be 
the naturally occurring protein but can be the protein produced by recombinant or chemical 
synthetic methods known to those in the art. 
5 The term "transposable element" encompasses both transposons and derivatives 

thereof. The only limitation on the derivative is that it is capable of integrating into DNA, 
containing cis-acting sequences that interact with transacting proteins to effect integration 
of the element. 

The invention provides a transposable system that allow s simple integration of a 
10 transposable element into a given DNA target efficiently and with a relatively lovv degree of 
specificity, preferably random specificity. By "relatively" is intended the degree of 
specificity compared to the wild-type. 

The efficiency of integration can van depending upon the particular use for which 
insertion is desired. The mutations described herein increase the efficiency of integration 
15 compared to the w ild-type frequency. The invention encompasses an efficiency of one 
simple integration event per every 5-10 kilobases. Preferred levels of integration allow 
multiple simple insertions in different positions in every gene. 

Integration is also effected by the degree of specificity that the mutation confers or 
allows. Thus, specificity relates to the relationship of a target DNA sequence and the 
20 transposable system. 

A preferred degree of specificity results in an av erage insertion in even gene. A 
practical lower limit would be. on average, one insertion per twenty genes. 

For sequencing, greater than or equal to c )0° o of the insertions screened are at 
different locations (i.e. 10 insertions hit at least 9 different sites) so that almost even 
25 template examined gives new information. This is true in DNAs of a v ariety ot different 
base compositions since possible target DNAs may van between 20° o and 80° o (i C. 
Another way to describe the possible randomness of the system is to say that of 63 
insertions. 62 insertion sites were found (around c )8°o of insertions are at different 
locations). 
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For mutagenesis, non-commercial systems have been widely used that yield as little 
as 10% of insertions at different sites (i.e. c > of 10 insertions are at the same site). 1 he 
present invention improv es on this level of randomness. 



5 Furthermore, the types of insertions that are relevant to the discussion of frequency 

are simple insertions. 

The invention provides a transposable system with a mutation that provides tor 
efficient, simple insertion and reduced or random target site specificity. 

The term "simple insertion'* refers to a single copy integration event of the element 
10 introduced into the target by double-strand breakage and rejoining. 

Although simple insertions (only one copy of the integrant) are preferred, there may- 
be certain embodiments in which more than one copy does not interfere with the purpose of 
the application, for example some applications of//? vitro mutagenesis, or is actually 
desirable (for example, for multiple copies of a heterologous DNA sequence are to be 
15 inserted). Accordingly, the invention is not limited to the case in which the transposable 
system provides for simple insertion only. 

In a preferred embodiment of the invention the mutation is in a transposon-derived 
A TP-utilizing regulatory protein. One can recognize such a protein by its similarity to the 
TnsC protein of Tn7. that is by its sequence homology, its possession of a protein sequence 
20 motif element similar to an A I P binding site motif in other ATP-dependent proteins, or by 
reconstitution of an //; vitro transposition system and demonstration of a requirement for 
nucleotides in that /// vitro transposition system. 

In a highly preferred embodiment, the mutation is in the TnsC gene (SHQ II) NO: 1 ) 
encoding the TnsC protein (SHQ II) NOS:I and 2) of Tn7. This mutation provides a Tn7 
25 transposon that is capable of relatively non-specific insertion into a given DNA segment. 

Thus, the invention is directed to insertion of the Tn7 transposable element but is 
not limited to this transposable element. Accordingly, the invention can be practiced with 
transposable elements related to Tn7 in that transposition occurs by means of an A 1 P- 
mediated process. finis, mutations in the A I P-utilizing proteins in such transposons is 
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contemplated in this disclosure. Accordingly, transposons with A I P-utilizing regulatory 
proteins in addition to Tn7 are encompassed in the invention. Examples of such 
transposons are Tn5()90 Tn420: the transposon-encoded transposition proteins are TniA. 
TniB. and TniQ. The TniB would he the ATP-utilizing protein. 
5 Another class of transposon is encompassed by the invention in which it is possible 

to increase the frequency by altering the ATP-utilizing proteins examples are Tn552 and 
IS21. 

The invention provides for the insertion of the transposable elements described 
herein into any DNA segment of any organism. Moreover, the invention also provides for 

10 the insertion into any synthetic DNA segment. 

Insertion of the transposable element can be in vivo. In this case, the transposable 
element is introduced into a desired host cell, where it inserts directly into DNA in that cell. 
The only limitation is that the transposable element be capable of insertion in the specific 
host cell DNA. Thus, as long as the proteins required for transposition can be expressed in 

15 a desired cell, this cell can provide a host for insertion of the transposable element into any 
DNA found in that host cell. 

Insertion can be for the purpose of gene inactivation. Gene inactivation is useful for 
genetic analysis (e.g. gene function). 
Genetic analysis includes: 

20 assessment of the phenotype of a null allele (not expressing functional protein due to 

interruption of the gene by the transposable segment); assessment of the consequences of 
insertion of particular active DNA structures or sequences for genetic properties of 
chromosomes or their parts, such as but not limited to accessibility to Dnase I or to 
footprinting reagents, or expression or silencing of nearby transcribable genes, or for 

25 activity of genetic or epigenetic processes such as. but not limited to homologous 
recombination, chemical mutagenesis, oxidative DNA damages. DNA methylation. 
insertion of proviruses or retroposons: assessment of protein domain structure via creation 
of multiple interruption points within a gene for a multidomain protein, wherein a gene 
product missing one or more domains of the multidomain protein might exhibit partial 

Rl \nl :i(>S5!*M \tt\ Dkt So 35"S^ 24 1 S25 i ^Xw. ; \ , 




activity or activities, including antigenic activities or immunodominant epitopes 
| randomness is paramount here, many insertion positions are needed if borders are to be 
defined accurately); assessment of expression pattern via creation of transcriptional fusions 
of a promoter in the target to a reporter ( e.g. beta galactosidase or green tlourescent protein 

5 or chloramphenicol transacetylase or luciferase) within the transposable segment; 

assessment of expression pattern via creation of translational fusions of a portion of a gene 
product encoded by a target to a gene product or an antigenic peptide encoded by the 
transposable segment (e.g. beta galactosidase or an epitope tag or an affinity tag); 
assessment of operon structure, in which interruption of transcription by insertion upstream 

10 of a gene results in altered expression of a gene without disrupting the coding sequence of 
that gene; gratuitous expression of a gene, in which transcription from a promoter within 
the transposable segment results in expression of a gene downstream of the position of 
insertion of the transposable segment, with or without regulation of transcription of the 
promoter w ithin the transposable segment; gratuitous expression of a protein fusion, in 

\5 which transcription from a promoter within the transposable segment results in translation 
of a protein beginning within the transposable segment and proceeding toward the outside 
of the transposon, then continuing into the gene w ithin w hich the transposable segment is 
inserted, resulting in a fusion of the transposon-encoded protein w ith the target protein; 
assessment of the consequences of introducing into the host cell any transcript or gene 

20 product entirely encoded within the transposable segment, especially where it is desirable 
to assess position-effects (the consequences not only of expression but of expression in 
different positions within the genome). 

Insertion can also be for the purpose of introducing heterologous ON A sequences 
into the DNA of a host cell. The DNA in the host cell in which the insertion occurs can be 

25 the host genomic DNA or extrachromosomal elements. This includes both naturally- 
occurring elements and elements introduced exogenous]) . 

Heterologous genes that can be introduced via the insertion include reporter genes. 
DNA sequences can also be introduced that provide physical markers in a chromosome. 
Insertion can also be used as a simple way to recover the host DNA that is Hanking the 
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inserted element. Genomic DNA is cut with restriction enzymes and the insertion plus the 
flanking DNA is then cloned. 

Another utility or another application of the invention is to analyze the interaction 
of various non-transposition proteins with a DNA sequence, for example. DNase 
5 footprinting of repressors bound to DNA. A further use is to study the structure of genomic 
chromatin i.e.. the state at which DNA is actually found in the cell. 

A further advantage in using Tn7 and similar transposons is that of double end or 
"concerted" joining. Accordingly, Tn7 inserts in a "cut and paste" manner with both ends 
of the transposon being joined to the target DNA. 
10 Insertion can also be in vitro. In vitro insertion provides an advantage over insertion 

in vivo. Using in vitro insertion, the transposable element can be placed in any DNA target 
and that target then introduced into a host cell where it can integrate or replicate. 
Accordingly, this greatly expands the host cell range. 

Targets for insertion, accordingly, include DNA fragments, plasmids and other 
15 extrachromosomal elements capable of replication in prokaryotic and/or eukaryotie host 
cells. Given the array of plasmids available, potentially any cell can be used as a host for 
an insertion target containing a transposable element that was introduced into the target in 
vitro. The target can be based on a bacterial plasmid, bacteriophage, plant virus, retrovirus, 
DNA virus, autonomously replicating extra chromosomal DNA element, linear plasmid. 
20 mitochondrial or other organelle DNA. chromosomal DNA. and the like. 

When introduced into the host cell, the target can be maintained as an autonomously 
replicating sequence or extrachromosomal element or can be integrated into host DNA. 
When integrated, integration can occur by homologous recombination or by means of 
specific integration sequences such as those derived from retroviruses. DNA viruses, and 
25 the like. 

It may be. but is not necessarily, desirable to obtain replication of the target in the 
host cell. A specific application in which this is desirable is the case in which a 
transposable element is used as a component for introducing primer binding sites for DNA 
sequencing. 
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Accordingly, in a highly preferred embodiment of the invention, a transposable 
element is introduced into a target containing a DNA segment for which a sequence is 
desired. This target is then introduced into a host cell where it is allowed to replicate, thus 
producing sufficient copies to allow DNA sequencing using a primer specifically 
5 recognizing a sequence in the target. 

In one embodiment of this method, the primers recognize one or both ends of the 
transposable element such that sequencing can proceed bidirectional ly from the 
transposable element insertion site into the surrounding DNA. The target may be 
composed entirely of DNA segments for which the sequence is required or may simply 
10 contain subsequences for which a sequence is required. In this aspect the only limitation on 
the target is that it is able to replicate in the host cell (and therefore contains sequences that 
allow this to occur). 

It is also highly desirable that the target have a selection marker in order to 
eliminate the background in host cells containing the target w ithout the insertion of the 
15 transposon. 

An alternative way to eliminate this background, however, is to provide a method 
for disabling a target that has not received an insertion so that it is unable to replicate in the 
host cell and is thus diluted out during host cell culture. Accordingly, the transposable 
element itself could contain an origin of replication for the host cell. Thus targets not 

20 receiving an insertion would be unable to replicate. An insertion could also result in the 

formation of functional replication sequences. The target could also contain a heterologous 
conditional origin, such as the R6K origin, that cannot replicate without the pir protein. 
The person of ordinary skill in these arts would be aw are of the various methods for 
constructing targets with the (in)ability to replicate in a specific host cell. 

25 It is also possible, however, to use the transposable elements described herein for 

DNA sequencing without the /// vitro insertion described above. Insertion could be directly 
accomplished in host cell DNA and then the DNA containing the insertion removed from 
the host. This DNA segment could then be replicated although it does not necessarily have 
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to be if the host has produced sufficient copies for sequencing. Accordingly, sufficient 
numbers of the segment w ith the insertion sequence could then be sequenced as above. 

An example of the case in w hich the DNA segment receiving an in vivo insertion 
would not need to be further replicated in another host is. for example, a case in which the 
5 insertion occurs in a sequence capable of being amplified directly in the host ceil. This 
could be a plasmid containing an amplifiable marker, such as the dhfr gene, the cell being 
grown in a selective medium containing methotrexate. The person of ordinary skill in the 
art would know the various methods for amplifying DNA segments using selectable 
markers. The selectable marker could be introduced on the transposon but would not 

10 necessarily need to be. 

In a further DNA sequencing protocol, the primers that are used facilitate DNA 
segment amplification by the PCR reaction. For example, a primer can be used that 
recognizes an end of the transposable element w ith the second primer being found in the 
target DNA sequence. The primer could be based on random sequences or on known 

15 sequences deliberately placed in the target vehicle. Thus the target vehicle could contain a 
characterized plasmid (as an example) in which the sequences are known. In this instance, 
primers can be designed to hybridize to any area within the plasmid. the segment to be 
sequenced being between the transposon and the second primer site in the target vehicle. 

In accordance w ith the above-described embodiment, the invention is also directed 

20 to kits for performing transposable element insertion in vitro. As described, such insertions 
can be used to provide priming sites for DNA sequence determination or to provide 
mutations suitable for genetic analysis or both. 

Hssential components in the kit are gene products allowing transposition that are 
normally encoded on the transposable element or their functional equivalents. A further 

25 component is a transposable element donor vehicle. This nucleic acid vehicle provides the 
transposable element to be inserted into a given specific target. The transposable element 
donor is preferably DNA but could encompass RNA. being operable via a cDNA copy. 
Preferred DNA vehicles include, but are not limited to. bacterial plasmids. Other vehicles 
include any DNA that can be isolated in super coiled form or placed into a super coiled 
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configuration by the use of topoisomerases. for example, bacteriophage^ DNA. 
autonomous]} replicating molecules from eukaryotes or archae. or synthetic DNA that can 
be ligated to form a topologicals closed circle. 

Optional components of the kit include one or more of the following: ( 1 ) buffer 

5 constituents. (2) control target plasmid. (3 ) sequencing primers. The buffer can include any 
buffer suitable for allowing the transposition activity to occur in vitro. A preferred 
embodiment is HEPHS buffer. A specific disclosed embodiment is included in the 
exemplar) material herein. 

Preferred donor plasmids do not need to be destroyed before introducing 

10 transposition products into commonly used bacterial and preferably coli strains. These 
vectors do not replicate without regulatory genes not provided by the host cell which allow 
a functional replication origin. An example is the pir gene which is present only in 
specially constructed strains, having been derived from the plasmid R6K. In this way. 
artifactual background consisting of cells transformed w ith both the donor DNA and the 

15 target DNA without any transposition having occurred is eliminated. As discussed herein, 
there are other ways to do this such as restriction digestion of the donor DNA but not of the 
target or transposable segment or deletion and titration of the transposition reaction so that 
there are more cells than DNA molecules in the transformation step. However, these are 
not preferred. 

20 The control target plasmid does not contain the transposable element and does 

contain transposable element integration site. The purpose is to assure that the reaction is 
not inhibited by a contaminant in non-kit ingredients (introduced by the kit user); i.e. it 
ensures that all components allow optimal insertion. 

Sequencing primers include, but are not limited to primers that have homology with 

25 both ends of the transposable element and. as such, allow sequencing to proceed 

bidirectionally from the ends of the transposable element. However, primers could be 
made to any area within the transposable element or within the target vehicle itself as long 
as extension is allowed into the DNA segment to be sequenced. Kits designed for allowing 
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sequencing by the PCR reaction may also include a second primer that allows the 
amplification of the sequence between the first and second primers. 

The control target plasmid preferably contains a selectable marker tor recovery of 
the desired DNA segment from a specific host cell. It is understood that, when using the 
5 kit. the target DNA does not earn the same selectable marker as the control target nucleic 
acid. 

A fourth optional component of a kit is target DNA itself. Target DNA that might 
be desirable would include but is not limited to purified chromosomal DNA. total cDNA, 
cDNA fractionated according to tissue or expression state (e.g. after heat shock or after 
10 cytokine treatment other other treatment) or expression time (after any such treatment) or 
dev elopmental Mage, or plasmid, cosmid, BAC YAC or phage library of any of the 
foregoing DNA samples, especially such target DNA from important study organisms such 
as Homo sapiens, \fits domesticus, Mus spretus. Cam's domesticus. Bos, Caenorhabditis 
elegans, Plasmodium falciparum, Plasmodium vivax. Onchocerca volvulus, Brugia malayi, 
15 Dirofilaria immitis, Leishmania, Zea maize, Arabidopsis thaliana, Glycine max, 

Drosophila melanogaster, Saccharomyces cerevisiae, Schizosaccharomyces pombe, 
Xeurospora, Escherichia coll Salmonella typhimurium, Bacillus suhtilis, Xeisseria 
gonorrhoeae. Staphylococcus aureus. Streptococcus pneumonia, Mycobacterium 
tuberculosis, Acptifex, Thermits aquaticus, Pyrococcus furiosus. Thermits littora/is, 
20 Methanobacterium thermoautotrophicum, Sulfolobus caldoaceticus, and others. 

Other suitable selectable markers include chloramphenicol resistance, tetracycline 
resistance, spectinomycin resistance, streptomycin resistance, erythromycin resistance, 
rifampicin resistance, bleomycin resistance, thermally adapted kanamycin resistance, 
gentamycin resistance, hygromycin resistance, trimethoprim resistance, dihydrofolate 
25 reductase (D1IFR), OPT; the IJRA3, HIS4. IMV2. and TRP1 genes of S. cerevisiae. 

There may be certain instances in which it is desired to introduce primer binding 
sites other than those naturally found in the transposablc element or in the insertion vehicle. 
In this case, the transposablc element can be used as a vehicle for introducing any desired 
primer or primers. An example of w hen the use of exogenous primers may be desirable is 
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the ease in which the transposable element ends form a secondary structure that interferes 
with sequencing, or cases in which there is a similarity of sequence between the two ends 
of the transposable element, and cases in which the only practical binding sites in the 
transposable element are so far internal that they undesirably curtail the amount of 
5 nucleotides that can be sequenced from that site. 

The invention also generally encompasses compositions containing an A I In- 
dependent DNA binding protein encoded by a transposon, the protein containing a mutation 
conferring reduced target site specificity, preferably random target site insertion. 

The protein is isolated from a biological preparation produced in vivo or in vitro. 
10 Thus, the protein is purified or substantially purified from cellular components with which 
it is found in vivo. When produced in vitro, the protein may also be purified or 
substantially purified from the other components used to produce it. 

In preferred embodiments the protein is the TnsC protein (SHQ ID NOS: 1 and 2). 

In a specific disclosed embodiment, the protein contains a valine at amino acid 
15 number 225. 

The invention is also directed to compositions containing the protein described 
herein and the transposable element substrate on which the protein acts to cause insertion. 

Compositions can also include target DNA into which the transposable element is 
capable of being inserted. 

20 The mutant proteins of the present invention include the naturally occurring proteins 

encoded by a transposon as well as any substantially homologous and or functionally 
equivalent variants thereof. By "variant" protein is intended a protein derived from the 
native protein by deletion (so-called truncation) or addition of one or more amino acids to 
the N-terminal and or ('-terminal end of the native protein; deletion or addition oWmc or 

25 more amino acids at one or more sites in the native protein; or substitution of one or more 
amino acids at one or more sites in the native protein. Such variants may result from, for 
example, genetic polymorphism or from human manipulation. Methods for such 
manipulations are generally known in the art. 
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For example, amino acid sequence v ariants of the polypeptide can be prepared by 
mutations in the cloned DNA sequence encoding the native protein of interest. Methods for 
mutagenesis and nucleotide sequence alterations are well known in the art. See. for 
example, (37) (38) (39) (40); U.S. Patent No. 4.873.192; and the references cited therein; 

5 herein incorporated by reference. Guidance as to appropriate amino acid substitutions that 
do not affect biological activity of the protein of interest may be found in the model of 
Dayhoff?/ al. (1978) (41 ) in Atlas of Protein Sequence and Structure (Natl. Biomed. Res. 
Found.. W ashington. D.C.). herein incorporated by reference. Conservative substitutions, 
such as exchanging one amino acid with another having similar properties, may be 

10 preferred. 

In construe In m variants of the protein of interest, modifications to the nucleotide 
sequences encoding the variants will be made such that variants continue to possess the 
desired activity. Obviously, any mutations made in the DNA encoding the variant protein 
must not place the sequence out of reading frame and preferably w ill not create 

15 complementary regions that could produce secondary mRNA structure. See HP Patent 
Application Publication No. 75.444. 

Thus nucleotide sequences of the invention and the proteins encoded thereby 
include the naturally occurring forms as well as variants thereof. The variant proteins will 
be substantially homologous and functionally equivalent to the native protein. A variant of 

20 a native protein can be "substantially homologous** to the native protein when at least about 
80%. more preferably at least about 90%, and most preferably at least about 95% of its 
amino acid sequence is identical to the amino acid sequence of the native protein. 
However, substantial homology includes high homology in the catalytic or other conserved 
functional regions with possible low homology outside these. By "functionally equivalent" 

25 is intended that the sequence of the v ariant defines a chain that produces a protein having 
substantially the same biological effect as the native protein of interest. Thus, for purposes 
of the present invention, a functionally equivalent variant w ill confer the phenotype of 
activating transposition with reduced target site specificity, preferably random. Such 
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functionally equivalent variants that comprise substantial sequence variations are also 
encompassed by the invention. 

The invention also encompasses compositions containing a transposable element 
containing DNA sequence encoding an A TP-utilizing regulator) protein, the protein 
5 containing a mutation that confers reduced target site specificity and preferably random 
insertion. 

In preferred embodiments of the invention, the transposable element is a Tn7 
transposable element. 

In specific disclosed embodiments, the mutation is valine as amino acid number 225 
10 in the TnsC protein. 

The invention also encompasses compositions containing the above-described 
transposable element and a given DNA segment intended to be the target for insertion of 
the transposable element. 

The invention, accordingly, is directed to DNA into which has been inserted the 
15 transposable element containing the mutation described herein that confers simple, efficient 
insertion with reduced target site specif city or random target site insertion. The DNA in 
this composition, in one embodiment, is capable of being introduced into a cell in which it 
can exist as an extrachromosomal element or as an integration element into cellular DNA. 
The invention is also directed to DNA segments encoding the mutant proteins 
20 disclosed herein, vectors containing these segments and host cells containing the vectors. 
The vectors containing the DNA segments may be used to propagate (i.e. amplify) the 
segment in an appropriate host cell and 'or to allow expression from the segment (i.e. an 
expression vector). The person of ordinary skill in the art would be aware of the various 
vectors available for propagation and expression of a cloned DNA sequence. In a preferred 
25 embodiment, a DNA segment encoding mutant TnsC protein is contained in a plasmid 

vector that allows expression of the protein and subsequent isolation and purification of the 
protein produced by the recombinant vector. Accordingly, the proteins disclosed herein can 
be purified following expression from the native transposon. obtained by chemical 
synthesis, or obtained by recombinant methods. 
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Relevant compositions, accordingly, include expression vectors for the mutant 
protein alone or in combination with expression vectors for the other proteins necessary for 
insertion of a transposahle element. Such compositions may further comprise the 
transposable element to be acted upon by the proteins. Such mixtures are useful for 
5 achieving in vivo insertion, among other things. 

The invention further encompasses kits containing the above-described 
compositions. 

Tn7 can be obtained as strain A I CC 29181: a K- 12 derivative carrying the 
resistance transfer factor R483; originally identified as carrying a transposon in Barth et al. 
10 ./. Bacterial. 725:800-810 (1976). The sequence of Tn7 is Genbank entry ISTN7TNS, 
Assession no. XI 7693; reported in F lores et al. Nucleic Acids Res. 1<S:90\-] 1 (1990). 

{laving now generally described this invention, the same will be further described 
by reference to certain specific examples which are provided herein for purposes of 
illustration only and are not intended to be limiting unless otherwise specified. 
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EXPERIMENTAL 

Example 1 

Materials and Methods 
5 Media, chemicals, and enzymes: LB broth and agar were prepared as described 

(42). Trimethoprim selection was on Isosensitest agar (Oxoid). Lac phenotypes were 
evaluated on MacConkey lactose agar (Ditto). Antibiotic concentrations used were 100 
//g nil carbenieillin (Cb). 30 //g/ml chloramphenicol (Cm). 7.5 //g/ml gentamycin (Gn). 50 
//g/ml kanamycin (Km). 10//g/ml nalidixic acid (Nal). 20 //g/ml tetracycline (let) and 100 
10 //g/ml trimethoprim (Tp). Hydroxylamine was purchased from Sigma. UNA modifying 
enzymes were purchased from commercial sources and used as recommended by the 
manufacturer. 

Bacterial strains, phages and plasmids: BR293 is E. coli F A(lac-pro) thi rpsi 
M aal - )Xi) + iacZ pL d+434 pRS 7 (43) (44). BR293 is identical to NK8027 (45), and was 

15 provided by Nancy Kleckner. NLC5 1 is E. coli F~ araD139 A( areF-lac ) U169 rpsLlSQ 
relAl QbB5301 deoCl ptsF25 rbsR Val K recA56 (46). CW5 1 is E coli F" ara arg Mac- 
proXIll recA56 Nal K Rit^(ll). >.KK1 is lambda 780 hisG9424 ::Tnl 0 del 16 
dell 7: \attTn ~::miniTn7-Km K (47). Ins transposition proteins were provided by pCW15 
( tnsABC ). pC\V23 ( tnsD ), pCW30 ( tnsE ), or pC \V4 ( tnsABCDF ) (11). Target plasmids 

20 were derivatives of pOX-G, a conjugable derivative of the F plasmid that carries Gn (48). 
p()X-att'nr carries a (-342 to -165) attTn~ sequence ( 16). The immune plasmid pOX- 
att'nr HP-1 ::miniTn7-Cm R was made by transposing miniTn7-Cm K (47) onto pOX-attTn" 
using TnsABC -F to direct the insertion into a non-<///7//~ position. Construction of the 
immune target plasmid p()\-G::miniTn7-<//?/r is described below . The transposon donor 

25 plasmid for the papillation assay was p()X-G::miniTn71ac. containing promotcrlcss lac/Y 
between the transposon ends (50). The high copy transposon donor for mating-out assays 
was pFMA. containing mini Tn7-Km R (23). 
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Manipulation and characterization of DNA: Phage and plasmid isolation, 
transformation, and standard cloning techniques were performed as described in (40). 
Conjugation and PI transduction were performed as described in (42). DNA sequencing 
was done on an automated ABI sequencer. Two plasmids were constructed in this work: 
5 ( 1 ) p()X-G::miniTn7-t//z//\ Mini \\\l_-dhfr was constructed by replacing the Km K cassette in 
pLAl (16) with a dhfr cassette from pSI)51 1 (28). which had been amplified by PCR to 
add Hanking Sail sites. The PCR fragment was ligated into the TA vector (Invitrogen), the 
dhfr cassette was then removed by Sai l digestion and inserted into the Sai l site of pLAl, 
replacing the Km R gene. The resulting plasmid was transformed into 

10 NLC5 1 +pCW4+pOX-G, and grown for several days to allow transposition to occur. pOX- 
G plasmids which had received a mini Ynl-dhfr insertion were identified by mating into 
CW51 and selecting for Tp R 

Mutagenesis of tnsC : The TnsABC plasmid pCW15 was exposed to 1M 
hydroxylamine hydrochloride in 0.45 M NaOH (final pll approximately 7.0) at 37 C for 20 

15 hours (ROSK et al. 1990). The DNA was recovered by multiple ethanol precipitations, and 
PvuII-Sphl fragments containing mutagenized tnsC were subcloned into untreated pCW15. 
replacing the wild-type TnsC (SLQ ID NO. 1 ) . These plasmids were then introduced into 
CW51 - p()X-G::miniTn 71ac by electroporation, and transformants were selected on 
MacConkey lactose plates containing Cm. The plates were incubated at 30 C for 3-4 days, 

20 and screened for the emergence of Lac papillae, indicating transposition of mini Tn 7 lac . 

>- hop transposition assay: Tn7 transposition was evaluated in NLC51 strains into 
which tns functions were introduced by transformation, and pOX-G was introduced by 
conjugation (for f igure 5). The protocol of (47) was followed: Cells were grown in LB and 
0.2° o maltose at 37^C to an OD w >o of 0.4-0.6 and then concentrated to 1 .6 x 10 cells -'ml bv 

25 centri fugation and resuspension in 10 m\l MgS0 4 . 0.1 ml cells were combined with 0.1 
ml aKKI containing miniTn7-km R at a multiplicity of infection of 0.1 phage per cell. The 

infection proceeded for 15 min at 37 C. and was terminated by the addition of 10 m\l 
sodium citrate in 0.8 ml LB. Cells were allowed to recover with aeration for 60 minutes at 
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37 t\ and then spread on plates containing Km and citrate. Transposition frequency is 
expressed as the number of Km K colonies pfu of /.KK1. 

Mating-out transposition assay: Tn7 transposition was evaluated in the 
derivatives of BR293 used to monitor SOS induction (Table 3). or in NLC51 strains into 

5 which tns functions were introduced by transformation, and pOX-Ci or p()X-G::miniTn7- 
c/hfr were introduced by conjugation. Mini Tn7-km R was present in the NLC51 strains 
either in the chromosomal attTn" site (Figure 4 and Table 1 ) or the high copy plasmid 
pHMA (Table 2) (SKQ ID NO:4). The protocol was adapted from (11): The donor strains 
described above and the recipient strain CW5 1 were grown at 37 to an OD 6 oo of 0.4-0.6 

10 with gentle aeration. Donors and recipients were mixed at a ratio of 1 :x and growth was 
continued for another hour. Mating was disrupted by vigorous vortexing, and the cells 
were diluted and plated. The total number of exconjugants was determined by selection on 
GnNal plates. Tn7-containing exconjugants were selected on TpNalplates. and miniTn7- 
Km R exconjugants were selected on KmNal plates. Transposition frequencies are 

15 expressed as the number of Tp R - or KLm R -exconjugants/total number of exconjugants. 



Results 

Isolation of the TnsC gain-of-function mutants: To focus on the relationship of 
TnsC and the target DNA. the inventor isolated gain-of-function TnsC mutants that 

20 activated the TnsA+B transposase in the absence of TnsD or TnsK. Since overexpression 
of wild-type TnsC does not relieve the requirement for TnsD or TnsH (11), these gain-of- 
function mutations were predicted to affect the biochemical properties of TnsC. rather than 
its expression or stability. 

A visual assay for Tn7 transposition (50) (51 ) was used to identify mutants. This 

25 assay uses a mini l n 71ac element which carries promotcrlcss lae/Y genes between the cis - 
acting sequences at the transposon ends. The mini Tn 71ac element is located in a 
transcriptionally silent position on a donor plasmid; cells containing this plasmid are 
phcnotypically Lac". When Tns functions are prov ided in trans. miniTn71ac can transpose 
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to new sites in the /:. coli chromosome. Some of those transposition events place the 
element downstream from active promoters, resulting in increased lac/ expression. This is 
observed on MacConkcy lactose color indicator plates as the emergence of red (Lac ) 
papillae in an otherwise white (Lac") colony. Therefore, the number of papillae reflects the 
5 amount of transposition w hich occurred during the growth of that colony. 

Cells containing miniTn 71ac and various Ins functions were patched on color 
indicator plates (Figure 1 ). Virtually no Lac papillae were seen in cells containing only 
TnsABC ut . Cells containing TnsABC ut +H produced many Lac^ papillae. Southern 
blotting demonstrated that TnsABC NSt +L papillae result from translocations of mini Tn 71ac 
10 to a variety of chromosomal locations rather than from intramolecular rearrangements of 
the donor plasmid (50). Most TnsABC wl +D events are silent because there is no 
appropriately oriented promoter adjacent to at(Tn~ (50) (52). 

This visual assay was used to screen for TnsC mutants that had acquired the ability 
to activate Tn7 transposition in the absence of TnsD or TnsE. Randomly mutageni/ed tnsC 
15 was cloned into a plasmid containing tnsAB . These tns genes were introduced into cells 
containing mini Tn 71ac . Six gain-of- function TnsC mutants were identified (figure 1 ). 

Transposition activated by these TnsC mutants still required the TnsA+B 
transposasc and intact transposon ends. The papillation phenotypes of the TnsC 1 mutants 
varied considerably, suggesting that different mutants were activating different amounts of 
20 miniTn 71ac transposition. Several TnsC mutants promoted more transposition than 
TnsABC wt +H. TnsABC s40lY A4n2 achieved the highest level of transposition. 

The amino acid changes responsible for the mutant phenotypes were determined by 
DNA subcloning and sequencing. tnsC encodes a protein of 555 amino acids, with Walker 
A and B motifs in the amino-terminal half of the protein (53). Walker A and B motifs have 
25 been implicated by structural and mutational analyses to be direct!) involved in nucleotide 
binding and or hydrolysis in a variety of A I Pases and GTPascs (37) (55). 

The tnsC mutations primarily result in single amino acid substitutions whose 
locations are scattered across the TnsC protein sequence (figure 2). TnsC mutants 
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segregate into two phenotypic classes. Transposition reactions activated by Class I mutants 
are sensitive to immune targets and the target selection factors TnsI) and TnsI;. 
Transposition reactions activated by the Class II mutants are impaired in their responses to 
these signals. The residues affected in two of the mutants (TnsC A " : X and TnsC 1 ~ K>K ) He 
5 in or very close to the Walker B motif. 

TnsC mutants promote intermolecular transposition: The papulation assay is a 
powerful screen for transposition activity, but it does not necessarily report intermolecular 
transposition events. Internal rearrangements of the mini Tn 71ac donor plasmid, w hich 
fortuitously place the mini l n 71ac element downstream from a promoter, would also 

10 produce Lac papillae. Therefore, the inventor investigated whcthei the TnsC mutants 
facilitate the TnsA+B transposase to do intramolecular recombination, or whether the 
mutants promote intermolecular transposition. 

The X hop assay measures the translocation of a miniTn7-Km R element from a 
replication- and integration-defective X phage to the bacterial chromosome during a 

15 transient infection. The miniTn7-Km element carries a kanamyein resistance cassette 
with a constitutive promoter. Therefore, the A hop assay reports the total number of 
transposition events occurring into the chromosome. TnsABC ul had no detectable 
transposition activity in the k hop assay. TnsABC UI +T generated 2.2 \ 10" 7 Km R 
colonies/pfu (Figure 3). Transposition promoted by TnsABC ut +D generated 1.8 x 10 4 

20 Km R colonies/pfu. All of the TnsC mutants could promote the translocation of mini l n7 - 
Km R . TnsABC A "" ?N and TnsABC S4<m A4(L promoted 8- and 50-fold more transposition 
than TnsABC" l -»-I:. Other TnsC mutants promoted transposition, although not at such 
levels. 

The mating-out assay was used to explore the ability of the TnsC mutants to 
25 promote translocations into a different type of target molecule. This assay measures the 

frequency of transposition of miniTn7-Km k from the chromosome to p()\-G. a conjugahle 
derivative of the E. coli V factor. The TnsABC u * I: machinery preferentially selects 
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conjugablc plasmids as targets for transposition, whereas the I nsABC ut 4) machinery does 
not recognize p()X-G unless it contains aitTn" sequences (ROGFRS ct at. 1986. 
WADDHFF and CRAIG 1988. WOLKOW et al 1996). The TnsC mutants could promote 
transposition to pOX-G (Figure 4). Thus, the results demonstrate that the gain-of-f unction 
5 TnsC mutants can promote intermolecular transposition. 

Effects of the target selection factors TnsD and TnsF: Frequencies of 
transposition of miniTn7-Km R from a lambda phage to the chromosome and/or p()X-G 
were measured in strains containing TnsA+B and the TnsC mutants, either alone or in 
combination w ith TnsD or TnsE. The preferred target for TnsF reactions, pOX-G, was 

10 introduced by conjugation into strains containing TnsC mutants or the TnsC mutants + 
TnsE. The distribution of miniTn7-Km R insertions between the chromosome and pOX-G 
w as determined by mating the pOX-G plasmids from the Km R products of a lambda hop 
assasy into the Km' strain CW51, and testing whether Km resistance was plasmid linked. 
No transposition was detected in strains containing TnsABC" 1 alone or TnsABC* 23 >K + 

15 TnsD. 

Response to the target selectors TnsD and TnsE: TnsD and TnsE are required to 
activate the TnsABC vvt machinery and to direct transposition into particular target DNAs 
( 10) ( 1 1 ) ( 1 3) ( 14). The TnsABC mutanI machineries, by definition, do not require the inputs 
of TnsD or TnsE. Howev er, the inventor inv estigated whether TnsD or TnsF could 
20 influence the frequencies or distribution of transposition ev ents promoted by the TnsC 
mutants. 

The K hop assay was used to evaluate the effects of TnsD and an available aitTn~ 
site on transposition promoted by the TnsC mutants. All of the mutant reactions were 

responsive to TnsD+<///7/7"\ but those responses varied widely. Reactions activated by 

W5V r "* ~* ^ K 

25 TnsABC ~~ and TnsABC were strongly stimulated by TnsD+<///7//~ promoting 

500- and 5000- fold more transposition, respectively, in the presence of TnsD+<///7>7~ than 

with TnsABC or TnsABC " * alone. The remaining mutant reactions were less 

profoundly influenced by TnsD: TnsABC* 4011 reactions showed a moderate (50-fold) 
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stimulation. Reactions activated by I nsC , InsL and Inst were 

somewhat inhibited in the presence of TnsD. 

The effects of TnsH was also studied by the a hop assay. In the absence of TnsH, 
the vast majority of the TnsABC nuium transposition events w ere targeted to the 
5 chromosome. In the presence of TnsH. preferential insertion into p()X-Ci was observ ed 
w ith some of the TnsC mutants. 

These differential responses suggest that the six TnsC mutants are not activating 
Tn7 transposition through a single mechanism. Instead, the mutants can be segregated into 
two classes, based on their ability to respond to TnsD and TnsH. Transposition activated by 
10 the Class I mutants - TnsC A ~ 2:>V and TnsC 1 '" 7 ^ - can by humiliated by TnsD and targeted 
to pOX-G by TnsE. Transposition activated by the Class II mutants - TnsC L " 33K , 

. r r ,A282I ~ ^S401YA4()2 , ^S401F . . . , v . r 

I nsC , TnsC and 1 nsC — is not responsive to the positive effects of 

TnsD or TnsI: or both. By these criteria. TnsC S40l! is proposed to be a member of Class II: 
although TnsC S4(HI -activated reactions are somewhat stimulated by TnsD. the distribution 
15 of insertions in TnsC S4011 -activated reactions is not affected by TnsI:. The grouping of the 
TnsC mutants into these two classes is supported by the differential responses of the 
TnsABC mulJnl reactions to immune targets, as described below. 



Discussion 

20 

Proteins involved in target evaluation: How is an appropriate target for Tn7 
transposition identified? The inventor has hypothesized that TnsC may serve as a 
"connector"" or "matchmaker*", linking the transposase and the target DNA in a manner 
regulated by the ATP state of TnsC (23) (27). TnsC 1 has the biochemical properties 
25 necessary for that connection: it can directly interact with target DNA (24) and with the 
Tns.VB transposase (A. STHITAVAGHN and X. H. CRAIG, unpublished results). 
However, wild-type TnsC (SHQ ID NO:l ) is not sufficient to activate transposition. 
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Instead. Tn7 transposition is dependent on TnsD or TnsE to activate the TnsABC 
machinery and select a target site. TnsI) is an aliTn~ binding protein (23) which recruits 
TnsC to this target. The resulting TnsC-TnsD-<///7>?~ complex can then attract the 
transposase in vitro (23). The mechanism by which TnsI; activates transposition is not yet 

5 known. TnsI: might be preferentially localized to conjugating plasmids and subsequently 
recruit TnsC to those molecules, or I'nsI: might modify TnsC so that TnsC's binding 
activity is now directed to those targets. Alternatively. TnsI: might modify the transposase 
directly, without proceeding through TnsC. The results suggest that TnsI) and TnsE 
provide alternative inputs into TnsC. which in turn recruits the TnsA+B transposase to the 

10 target DN A. 

T he successful isolation of TnsC gain-of- function mutants reveals that the 
TnsABC machinery is capable of engaging target DNA and promoting insertions w ithout 
TnsD or TnsE. However, the mutant reactions have not mimicked the abilities of TnsD 
or TnsE to direct transposition into particular targets: transposition activated by the TnsC 

15 mutants does not show the preferential insertion into conjugable plasmids seen with 
TnsE-activated reactions, nor the attTtv specificity of TnsD-aetivated reactions. 
Therefore. TnsD and TnsI: are essential to recognize these positive target signals. 

TnsC appears to receive a variety of inputs -- from TnsD. TnsI: and from immune 
targets -- which control its activity. The activity of TnsC can also be influenced by 

20 mutation. Six gain-of-function TnsC point mutants have been described in this work, 
which segregate into two classes. The fact that different classes of TnsC mutants w ith 
different transposition activities were recovered is consistent with the hypothesis that there 
are multiple routes to activating TnsC. The Class 1 mutants. TnsC " and TnsC 
enable the TnsABC machinery to execute transposition without sacrificing its ability to 

25 respond to both positive and negative target signals. Both are substantial gain-of-function 
mutants, with TnsABC' promoting eight-fold more transposition to the chromosomes 
than TnsABC ul - 1: (f igure 3). Transposition activated by these Class I mutants can be 
profoundly stimulated by TnsD ^///7)?~. or directed to conjugable plasmids by TnsI:. as 



R I V»l : 1mS5P\ i 



41 



\\w okt \o ^~xw :-4is:> i>"so-v-\i 




well as being able to discriminate between immune and non-immune targets. Thus, the 
gain-of-function phenotypes seen with the Class I mutants have been achieved while 
preserv ing the ability of these TnsCs to transduce information betw een the target DNA and 
the transposase. 

5 The TnsC mutants which fall into the second class behave much more like 

constitutively activ ated v ersions of TnsC. Some of these mutants also promote 
considerable amounts of transposition: TnsABC S4<M VU °~ results in 50-fold more 
transposition to the chromosomes than TnsABC ul +K (Figure 3). However, the nature of 
the transposition reactions promoted by the Class II TnsC mutants is quite different than 

10 those seen with the Class I mutants. Immune and non-immune targets are used essentially 
equivalently in reactions with the Class II mutants, and TnsD and TnsE are not able to 
profoundly influence the frequency or distribution of these transposition events. A similar 
loss of responsiveness to target signals is seen when Tn7 transposition is activated by 
nonhydrolyzable A I P analogs in vitro. Transposition can still occur when TnsCs ATPase 

15 activ ity is blocked with AMP-PNP, but those transposition events no longer require TnsD 
and are no longer targeted to attTn~ (BAIN I ON et at. 1993). Instead, any DNA molecule, 
including immune targets, can serve as a target for Tn7 insertion. Thus, TnsABC 
transposition can be constitutively activated by AMP-PNP or by the Class II TnsC mutants. 
It is noteworthy that the amino acid affected in TnsC 1 ~ >K lies in one of TnsCs A TP 

20 motifs. 

Comparison to other elements: The use of an ATP-dependent protein such as 
TnsC to regulate target site selection is not unique to Tn7. Bacteriophage Mu transposition 
is also profoundly influenced by its ATP-utili/ing protein MuB. MuB is an ATP- 
dependent DNA binding protein (57) (MAXWKLL ct ai 1987) which is required for 
25 efficient transposition in vivo (58) (59). In vitro, the MuA transposase preferentially directs 
insertions into targets that are bound by MuB (60) (61 ) (19). Although there is no 
particular sequence specificity to MuB binding, its distribution on DNA is not random: 
MuB binding to target molecules that already contain Mu sequences is specifically 
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destabilized through an A I ^-dependent mechanism (19). Therefore Mil like Tn7. 

\22^ V 

recognizes and avoids immune targets: moreover. MuB and I nsC appear to play 
functionally similar roles in regulating transposition. 

Mu and Tn7 belong to a family of transposons which encode proteins with ATP 

5 binding hydrolysis motifs; other members include IS2J_ (35) (62). Tn552 (36), Tn5053 (33). 
and Tn 5()90 (34). Therefore, the strategy of using an ATP binding protein to regulate target 
site selection may extend to the entire family. Tn 5053 is particularly interesting, since it 
encodes three proteins which are required for its transposition: a presumptive transposase 
containing a D, D(35)E motif characteristic of transposases and integrases. a potential 

10 regulatory protein containing Walker A and B motifs, and a third protein of unknown 
function (33). Tn 5053 shows some degree of target site specificity, inserting 
predominantly into the par locus of the conjugable plasmid RP4. It is tantalizing to 
speculate that the third protein of Tn5053 is a target selector, like TnsD or TnsK, directing 
insertions into the par locus. 

15 The inventor's work has illustrated the role of target DNA in controlling Tn7 

transposition in vivo, and has strongly implicated TnsC as a central player in this regulation. 
Single amino acid changes in TnsC can disrupt the communication between the transposon 
and the target site, reducing the stringency of Tn7's target site selectivity. TnsI) promotes 
Tn7 insertion at high frequency into attTn~« a safe haven in the bacterial chromosome. 

20 whereas TnsI: allows Tn7 access to conjugable plasmids. and thus a means to spread 

through bacterial populations. Avoidance of immune targets also promotes the spread of 
the element, rather than local hopping, and prevents one Tn7 element from inserting into 
another. TnsC may integrate all of these target signals, and communicate that information 
to the transposase. 
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TABLE 1 

\*>*>5V 

TnsC " promotes intcrmolccular transposition 



Tns functions Transposition frequency 



nsABC ut < 10" 



TnsABC A225V 8.8(±8.1)x I0" 6 



TnsABC wt DE 5.5 (± 1.1) x 10" 4 



5 Frequencies of transposition of mini Tn7-Km from a high copy plasmid to pOX-G were 
determined using the mating-out assay, and are expressed as the number of Km R 
exconjugants/total exconjugants. Hach value is the average of three independent 
measurements. 

i o Example 2 

Materials and Methods 

Media, Chemicals, and Enzymes 

15 Luria broth (LB) and agar were prepared as described by (42). Carbenicillin and 

kanamycin selections were carried out at a concentration of 100|Ag/ml. DNA modifying 
and restriction enzymes were purchased from commercial sources and used according to 
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manufacturer's instructions. Taq polymerase was purchased from Boehringer Mannheim 
Biochemicals. 

Bacterial Strains and Plasmids 

5 Tn7 donor plasmids contain a miniTn7 element in which the minimal end 

sequences of Tn7 (Tn7L 1-166 and Tn7R 1-199) Hank a selectable marker. A pBR plasmid 
containing a mTn7- kanamycin element with XotI and Spel sites at the ends of the 
kanamycin cassette has been shown to be an effective donor. When transposition products 
are to be recovered by transformation, it is useful to prevent transformation of unreacted 

10 donor. One strategy is to cut the donor backbone with a restriction enzyme that does not 
cut within the Tn7 element or within the target DNA. Another strategy is to use donor 
plasmids that will not replicate with the products recovered. One strategy is to make the 
replication of the donor depend on a protein that is not present in the transformation strain. 
For example, the m Tn7 element can be placed on a plasmid which does not itself encode 

15 an initiator protein for replication. A particular example is to make the donor backbone an 
R6K plasmid that does not encode the replicator protein pir. The R6Kpir - mini Tn7 
plasmid can then be grown in a strain which contains pir (supplied for example by a 
heterologous plasmid) and the transposition mixture transformed into a strain lacking pir. 
With selection tor the marker on the mTn7. only insertions into the target DNA will be 

20 recovered. Subcloning Efficiency DH5alpha competent cells were purchased from GIBCO 
BRL and used according to the manufacturer's instructions. 

The target plasmid pRM2 (SHQ ID NO:6) contains bases -342 to +165 ofattTn~ 
cloned into pl'C18 [47]. The donor plasmid pHMA (SHQ II) NO:4) carries a mini In 7 
element comprised of the 166 terminal bases of the left end of Tn7 and 199 bases of the 

25 right end Hanking a gene conferring resistance to kanamycin [23 J. 
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Tns Proteins 

The purification of TnsA and TnsB-His are described in (63). TnsA was stored in 
25mM Hopes (pi I 8.0). 150 mM NaCl. ImM EDTA. ImM I) I T. 5% glycerol at -80°C\ 
TnsB was TnsB-His. a derivative containing a C -terminal polyhistidine tag. and was stored 

5 in 25mM Hepes (pH 8.0). 500 mM KCI, 2mM D i r. 1 mg ml BSA. 25% glycerol at -8()°C. 
The purification of TnsC and TnsC A " >N is a modified procedure from (24) which is 
described in (25) (26). Both proteins were stored in 25mM Hepes (pH 8.0), 1M NaCl. 
2.5mM DTT, ImM ATP, 10mM MgCK O.lmM EDTA. lOmM CHAPS, 10% glycerol at - 
80°C. TnsD was TnsD-His (P. Sharpe and N. Craig, in preparation), a derivative 

10 containing a C -terminal polyhistidine tag, and was purified by Nf : chromatography before 
being stored in 50mM Tris (pH 7.5). 2mM DTT, 500 mM KCI. ImM KDTA and 25% 
glycerol at -80°C. 



Transposition Reactions in vitro 

15 Transposition reactions are adapted from the standard in vitro reaction described in 

(23). Reaction mixtures. 100 \i\ in volume, contained (final concentration) 0.25 nM pHMA 
donor. 1.9 nM pRM2 target. 26 mM Hepes. 4.2 mM Tris (pH 7.6). 50 ng/ml BSA. 100 
Hg/ml yeast tRNA. 2 mM ATP (pH 7.0), 2.1 mM DTT. 0.05 mM IT) TA. 0.2 mM MgCk 
0.2 mM CHAPS, 28 mM NaCl, 21 mM KCI. 1 .35% glycerol. 60 ng TnsA, 25 ng TnsB. 

20 either 100 ng TnsC NNl , or 100 ng TnsC A22:A . and 40 ng TnsD. unless otherwise indicated, in 
a 30 minute preincubation at 30°C. (TnsA=19nM. TnsB=3.InM. TnsC=16nM, 
TnsD=6.5nM). Magnesium acetate was added to a final concentration of 15 mM and the 
reactions were allowed to proceed for an additional 60 minutes at 30°C. Products were 
extracted with a 1 : 1 mixture of phenol chloroform, ethanol-precipitated. and resuspended 

25 in water in preparation for subsequent analyses. 



PC R Primers and Amplification 

Oligonucleotides used for the various PCR amplifications to analyze the products of 
transposition are: 
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NLC95 (SKQ ID NO. 7): (5') - ATAA TCCTTAAAAACTCCA I T TCCACCCCT - (3') 
NLC209 (SI:Q ID N().<S): (5*) - GTGAT TGCACCGA TC T TC TACACCG I ICC - (3') 
NLC429 (SHQ ID NO. 9): (5') - T I I CACCGTCA I CACCCiAAACCiCGCGAGAC - (3') 
NLC430 (SLQ ID NO. 10): (5') - AA TGAC TTGG T TGAG TAG TCACCAGTCACA - (3') 

5 NLC431 (SEiQ ID NO.l 1 ): (5') - ATGAACGAAATAGACAGATCGCTGAGATAG - (3*) 
NLC432 (SFQ ID NO. 12): (5') - CAAGACGATAGTTACCGGATAAGGCGCAGC - (3') 
Two percent of the 100 jliI transposition reaction was used as the template in a given 
PGR amplification. 100 pg of plasmid pMCB20 was used when amplifying a marker 
product for size comparison on the high resolution denaturing gels. 30 temperature cycles 

10 of 94°G for 1.0 minute, 55°C for 1.5 minutes, and 72°C for 1.5 minutes were run for a!! 
amplifications, followed by a single 5 minute incubation at 72°C. The buffer composition 
and quantity of Taq polymerase recommended by the manufacturer (Boehringer Mannheim 
Biochemicals) were used for all reactions. PCR products were ethanol-precipitated, 
resuspended in water, and loaded on a high resolution denaturing gel. 



Probe Labeling 

Oligonucleotide probes were 5' end-labeled with [gamma-* ,: P] ATP substrate and 
bacteriophage T4 polynucleotide kinase for 45 minutes at 37°C. Labeled probes were 
separated from unincorporated label by size exclusion through a G50 Nick Spin Column 
20 (Pharmacia). 

High Resolution Denaturing Gels 

The resuspended PCR products were electrophoresed on either a 5°o or 6°o 
polvacrylamide denaturing gel and electrotransferred to Gene Screen Plus membrane (du 
25 Pont). The resulting blots were visualized by hybridization with an appropriate 
oligonucleotide probe at 50°C and exposed overnight to phosphorimager screens 
(Molecular Dynamics), which were scanned the following day. 
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Results 

TnsC A2:>N Supports Efficient Transposition in vitro 

A diagram of Tn7 transposition is shown in f igure 5. Tn7 mobilizes via a cut-and- 
paste mechanism. whereby both ends of the element are first excised from the donor 
5 backbone by double-strand breaks, and join to the target DNA most likely via 

transesterification reactions to form simple insertions with short gaps at either end. Other 
possible intermediates of a transposition reaction are double-strand breaks (DSBs). w here 
one end of the transposon has been excised but the other end remains attached to the donor 
backbone, excised linear transposons (ELTs). where both ends have been excised from the 
10 donor and neither end has joined to the target, and double-strand break, single-end joins 
(DSB-SCJs), where one transposon end has been broken in the donor and joined to the 
target molecule. 

The Tn7 transposition reaction has been reconstituted in vitro, in which purified Tns 
proteins promote the transposition of a mini Tn7 element from a donor plasmid into an 

15 at! 7/7 -containing target plasmid (Bainton 1993). TnsABC wt +D supports this reaction with 
great efficiency. In the absence of TnsD. TnsABC ut does not generate a detectable level of 
insertion products (Figure 6. lane 2) although double-strand break intermediates are seen 
upon prolonged incubation. By contrast, reactions containing TnsABC A2fc?N show a 
dramatic accumulation of simple insertions, at efficiencies that approach TnsABC ut H) 

20 reactions (Figure 6. lane 5). Neither the TnsABC A22:A nor the TnsABC VM +I) reactions 
generate visible levels of DSB-SFJ products, indicating that the vast majority of Tn7 
transposition events result in the complete (i.e., two-ended) insertion of the transposon into 
the target DNA. rather than a single-ended insertion event. 

TnsABC-I) transposition is not only efficient, it is also very target site-specific. 

25 TnsABO 1) insertions occur almost exclusively into the attTn" site present on the target 
plasmid (Bainton, ct <//., 1993. data not shown). By contrast, the TnsAB( ,A " x insertions 
are not limited to the attTn' site. Alternative restriction analysis of the TnsABC x " nX 
reaction yields a smear of products on an agarose gel (data not show n ), suggesthe of a 
population of insertions located at many different positions in the target plasmid. To 
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investigate the distribution of these insertions, w e subjected the TnsABC " reaction 
products to high-resolution analysis, as described below. 

Distribution of TnsABC A22>x -Mediated Insertions is Highly Nonspecific 

5 A PCR-based approach has been used to analyze inscrtional mutations in SV40 and 

yeast TRP1ARS1 minichromosomcs [30. 31 1, and perform functional analyses of 
inscrtional mutations in yeast chromosome V and the /:. coli supF gene [Smith. 1996 -427] 
and [32]. respectively. 

PCR was utilized to survey the distribution of TnsABC A22A insertions previously 

10 seen on the agarose gel at higher resolution. The diagram in Figure 7 illustrates the PCR 
strategy used to amplify the population of insertion products present in a TnsABC A22>v 
reaction, with two representative insertions being shown as examples. One PCR primer 
(NLC95) (SEQ ID NO:7) hybridizes within the cis-acting end sequence of the inserted 
element and the other (NLC209) (SEQ ID N():8) hybridizes to an arbitrary position in the 

15 target molecule. Thus, the length of the PCR product reflects the positions of the insertions 
into the target molecule. 

Amplification of a pool of insertions generated a smear of reaction products w hen 
displayed on an agarose gel. as expected (data not shown). The PCR products were run on 
a 6% polyacrylamide denaturing gel to achieve single nucleotide resolution and visualized 

20 by Southern blotting and hybridization with a Tn7-specific probe (Fig. 7). The striking 
result is that the distribution of products is remarkably nonspecific. Insertions have 
occurred at nearly every base w ithin the highly resolved lower portion of the gel. PCR 
products of more than roughly 200 bp in length are resolved poorly. Some areas of dense 
signal are seen in this region, potentially indicating preferential points of insertion. 

25 However, compression of bands could also account for the apparently singular products; 
analysis of these insertion products w ith other primer pairs supports this latter possibility 
(see below). 
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This confirms the inventor's hypothesis that the I nsABC machinery is capable 
of directing Tn7 transposition into the target plasmid with high efficiency and low 
specificity. 

5 Surveillance of the Entire Target Plasmid 

In the experiments above, the focus was on a relatively short region of the target 
plasmid pRM2 (SHQ ID N():6). It was demonstrated that I nsABC A22 ^ can direct 
insertions into virtually every base pair of this region. To be certain that the phenomenon is 
not specific to the region of the plasmid. a family of primers was synthesized, each of 

10 w hich paired w ith a Tn7 end-specific primer to allow amplification of all regions of pRM2 
(ShQ ID NO:6). These primers are spaced at approximately 500 bp intervals around the 
target plasmid and will amplify insertions in predominantly one orientation. Figure 8 
diagrams the amplicons for each primer pair and shows a denaturing gel Southern blot of 
the resulting PCR products. The results indicate that the C A22rA -mediated insertions do 

15 occur into positions all around the target plasmid. As was seen for the original amplicon 
analyzed, there is considerable variability in the strength of the signal for individual points 
of insertion, but insertions do occur at some level at even position. Thus, the 
TnsABC A22>N machinery does not appear to have a specificity for any particular region of 
this target plasmid. 

20 In another approach to investigating the possible sequence specificity of 

TnsABC A22rA target site selection. 67 independent insertions into a 12 kb plasmid were 
collected and analyzed. TnsABC A22:A transposition reactions using a target plasmid 
containing several /:. coli genes were transformed into /:. coli to select kanamycin-resistant 
colonies. The target plasmids were then recovered and sequenced to determine the position 

25 of each insertion. 62 out of the 63 insertions were located in different positions on the 

target plasmid. A comparison of the sequences of these insertions supported our previous 
observations that there is very little sequence specificity governing the selection of 
TnsABC A ~ 2>N target sites. Attempts to derive a consensus sequence for the 5 bp target site 
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duplication sequence revealed a faint preference for NYNRN (SL:Q ID NO: 14). but the bias 
is not very compelling. 

Exploiting the TnsABC A22= * Machinery for in vitro .Mutagenesis 

5 The high efficiency and low target specificity of the TnsABC A22:A transposition 

reaction makes this a useful system for mutageni/ing a variety of DNA targets. Insertional 
mutagenesis could be performed on cosmid libraries. cDNA libraries. PCR products, 
BACs. YACs. and genomic DNAs, among others. The inventor has mutagenized pl.JC- 
based plasmids. cosmids. BACs ranging in size from 5 to 120 Kb (data not shown), and //. 

10 influenzae genomic DNA (Gwinn et al.« 1997). In fact, the inventor has not encountered 
DNA that cannot serve as a target for TnsABC A22rA transposition. 

Once DNA targets have been successfully mutagenized in vitro, the simple 
insertions will be recovered. For a simple insertion product to become a stable replicon, 
the 5' nonhomologous overhangs trailing off both ends of the inserted transposon must be 

15 removed, the gaps filled in. and the strands ligated. A simple method to perform such 

processing functions is to transform the pool of transposition products into a host and rely 
on the host's repair machinery, selecting for a transposon-horne marker. In /:. colL the 5* 
single-stranded overhangs and gaps on either end of the transposon after a simple insertion 
are readily repaired by the host (see below). The donor plasmid for other hosts could be 

20 customized in a number of ways to best facilitate the recovery of the desired insertional 
mutants. 

The inventor recovered simple insertions into pRM2 in E. coli. since Tn7 insertions 
can be easily repaired in this host. 

Simply transforming transposition reactions into host cells as a method to recover 
25 simple insertion isolates presents a background contributed by donor molecules that have 
not undergone transposition and thus continue to earn the selectable marker on a stable 
replicon. In order to eliminate the background false positives that can complicate a screen 
for insertional mutants, the ability of the unreaeted donor to transform cells can be reduced. 
Two methods have been provided: 1 ) destruction of the donor plasmids ability to replicate 
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by restriction digestion prior to the transposition reaction, and 2) use of a conditional 
replicon origin in the donor backbone w hich renders the donor incapable of replication in 
the cells being ultimately transformed with the transposition pool. 

For the first method. 5 identical TnsABC A "" N reactions were carried out on 
5 linearized pMCB31 donor DNA paired with cosmid clone KS#3 target DNA. an 

approximately 50 kb replicon which contains an insert of genomic DNA from /:. tarda. 
Linearizing the plasmid will prev ent the donor plasmid from replicating once transformed 
into the host. The products were pooled for the extraction and precipitation steps, and then 
a portion of the resultant sample was transformed into BRL Subcloning Efficiency DH5- 

10 alpha competent cells. Assuming a 10% loss in the recover) of the DNA after the 

transposition reaction, the efficiency of transformation relative to jig of input donor DNA 
was approximately 3.8 X 10 4 colonies/|ig/ml of cells. One-tenth of a microgram of donor 
DNA is typically used in a reaction, so by extension, if all of the product DNA from a 
single transposition reaction is transformed, 3800 colonies could be isolated, an efficient 

1 5 mutagenesis. The DI 15 alpha cells are advertised to have a transformation efficiency of equal 
to or greater than 1 X 10 7 colonies/|ig supercoiled pUC19/ml of cells. Simply using higher 
efficiency cells or electroporation cells should yield considerably higher numbers of 
isolates from a single transposition reaction, and probably aid in picking up rarer events. 

Another method employs a heterologous origin of replication on the donor plasmid. 

20 for example. R6K. Replicons relying on this origin must be maintained in a host carrying a 
resident copy of the pir gene, w hich codes for the n protein, a necessary component for 
initiation of replication at R6K,- imma origins. Thus, it is simple to eliminate false positiv es 
stemming from unreacted donor molecules simply by transforming the transposition 
products into pir cells, and relying on the competent origin of replication in the target 

25 molecule for recov ery of simple insertion isolates. Transposition reactions employing this 
donor were prepared for transformation as described above. 

It is conceivable that the larger plasmid (-50kb) would be more difficult to 
transform after receiving an insertion because it would be a large open circular molecule 
approximately 10 times the size of the pRM2 (3.2 Kb) open circle with an insertion. To 
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gain insight into the possibility of target size limitations using the transformation method of 
simple insertion recovery, transpositions of the mini Tn7 element from pMCB4() into the 
tw o target plasmids w ere directly compared. The transformation efficiencies of the two 
reactions were very similar. The different targets were included at comparable 
5 concentrations in the transposition reactions, but were not equimolor. The results suggest 
that ES#3 simple insertions can transform the cells at nearly the same efficiency as the 
smaller pRM2 simple insertions. It is difficult to test reaction conditions under which the 
cosmid target is available at the same molarity as the pRM2 target because elevated levels 
of total DNA in the reactions can compromise the reproducibility with which DNA is 
10 recovered after transposition. 

The high transformation efficiencies demonstrate the utility of this reaction for a 
mutagenesis in which the simple insertion products can be stably replicated in an E. coli 
host. This same type of protocol could be used in other bacterial species and strains w ith 
development of the appropriate DNA substrates. 

15 

Discussion 

TnsC A22:A circumvents the requirement for a targeting protein. 

Tn7 demonstrates considerable diversity when it comes to target site selection. It 
20 has a sophisticated system for choosing either a highly conserved "safe haven" in the E. coli 
chromosome (<////)/ ~) or somewhat random sites throughout a cell's genome or resident 
conjugable plasmid. mediating these different selections via alternative targeting proteins 
encoded by the element. In this way. Tn7 is significantly different than all other well- 
characterized transposable elements, whose target site selections are mediated 
25 predominantly by either the transposase alone (e.g.. ISlO/TnlO) or in conjunction with one 
other accessory protein (bacteriophage Mu). IS 1 0 I n 1 0 selects a target site via a direct 
interaction of the I n 10 transposase with the target DNA. It has been demonstrated that 
particular mutations in the Tn 10 transposase are capable of altering target recognition 
features w hile leaving other functions of the transposase unaffected (65). The 



Ri.\n| 2|oXMa i 



53 



,\it\ Dkt \»» s >> m S" : i^xn.; \. 




bacteriophage Mil. however, encodes a transposase, MuA. and an ATP-dependent activator 
of MuA. MuB. MuB functions as an accessor) protein that, when complexed with target 
DNA. attracts the MuA transposase to the site of insertion. It is likely that hav ing more 
proteins involved has allowed Tn7 to he more adaptive to env ironmental changes when 
5 choosing its new sites of residence, and ensured its survival by enabling it to employ a 
more tailored approach to disseminating itself amongst v arious cell populations. 

This example has focused on the role of TnsC in the selection of a target site. As 
discussed. TnsC has been implicated as the major communicator between the TnsAB 
transposase bound to donor DNA, and the TnsD or TnsE targeting proteins, complexed 

10 with target DNA. Experiments have shown that TnsC does have the capacity to hind DNA 
nonspecifically in the absence of TnsD and TnsE (ref) but attempts to isolate simple 
insertions in vivo and in vitro in the absence of the targeting proteins proved unsuccessful 
with w ild-type TnsC (23). Isolation of the TnsC A22>v mutant, however, has permitted the 
inventor to circumvent this requirement and isolate simple insertions from reactions lacking 

15 TnsD and TnsE. Not only does the mutant facilitate the recovery of simple insertions, it 
does so very efficiently. 



Ability of TnsC A22rA to Insert Nonspecifically 

20 It is clear that TnsC A22?N has a considerable gain of function over wildtype TnsC. as 

evidenced by the increased yield of simple insertions in a standard in vitro reaction (25) 
(26) (this example). A more detailed evaluation was necessary to determine the actual sites 
of insertion because restriction digests of the product pools indicated that there is extensiv e 
variability in site selection relative to TnsD-mediated insertions, which are targeted almost 

25 exclusively to the attachment site. PCR amplification of pools of transposition products 
followed by high resolution denaturing gel analysis of sev eral independent reactions has 
revealed that the insertions into the pRM2 target plasmid are detectable at ev ery base 
v isible w ithin the well-resolv ed portions of the gels. Although the target site selection is 
not completely random (there arc differences in band intensities), one possibility is that the 
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nonspecific DNA binding activity of TnsC has been enhanced in the TnsC mutant, 
giving the protein the capacity to direct the TnsAB transposase to the wide variety of 
insertion sites observed. 

It is possible that the TnsC A2 ~ N mutation has altered TnsC in such a way that it 
5 simulates a TnsC-TnsK complex, capable of insertions at more random sites. Perhaps the 
role of TnsK is to strengthen TnsC's nonspecific interaction with the target DNA. thereby 
promoting insertions into sites where TnsC and TnsI* happen to complex. The ability of 
TnsE to preferentially direct transposition to conjugating plasmids ( 14) holds true when 
TnsC A225V is substituted for wild-type TnsC (Sl£Q ID NOSrl and 2) (25) (26). This 

10 suggests that this mutation in TnsC does not compensate for a!! specific activities of a 

targeting protein. The TnsABC /X22?v reaction is also sensitive to the presence of the target 
site specific protein TnsD. as evidenced by a detectable increase in the frequency of 
insertions when TnsD is present. 

These observations may explain why Tn7 has chosen to preserve a more 

15 complicated target site selection mechanism. In a cell containing only wildtype proteins, an 
extra layer of regulation can be exercised when two proteins complex to direct insertions, 
and the result may be less deleterious to cell populations than the somewhat rampant levels 
of insertions observed in reactions w ith the TnsC A::A in the absence of targeting proteins. 
Occurrence of a mutation like TnsC A "' x in nature w ould decrease the specificity and 

20 increase the frequency of insertions, the consequence of w hich could quite possibly be 
more insertions into essential genes. 

It is conceivable that TnsC has always played the primary role in directing the 
TnsAB transposase to insert, and the targeting proteins are more accessory. The inventor 
has envisioned TnsD binding DNA near the attachment site, and TnsC acting as an 

25 activation bridge to the transposase. but an alternative view is that the ability of TnsC to 
bind DXA plays a more central role in directing the donor complex to an insertion site, and 
TnsD has the role of "steering" a TnsAB- -TnsC complex to a particular point of insertion. 
The A225Y point mutation could confer the ability for TnsC to "steer" the donor complex 
to insert without the aid of a target-binding protein. 
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There is No Apparent Sequence Preference at the Point of Insertion 

Two main approaches have been taken herein to analyze the TnsC " -mediated 
5 insertions at nucleotide resolution. The first involves scanning along a short segment of 
DNA using PCR and high resolution denaturing gel analysis, quantitating specific signals at 
each base in a processive manner, and attempting to flush out a sequence motif common to 
those with the highest signals or lowest signals. The second method focuses on the 
recovery of the more frequent insertions only, those that can be recovered by simply 

10 transforming the transposition products, and relying on the host to conduct a successful 
repair of the replicons. These two methods provide different views of a common process. 
Since recovery of specific insertions is reliant of the process of transformation, rare 
insertion events that can be visualized by the PCR/denaturing gel method will most likely 
be severely underrepresented in a population of recovered transformants, if we assume that 

15 a higher concentration of a specific template will give rise to a diagnostic PCR product of 
higher intensity. This should bias the representative data accumulated from transposition 
product transformations to overlap with the subset of PCR products analyzed by denaturing 
gel analysis with the highest band intensities. In this way. both types of data are v alid for 
attempting to determine a preferred insertion site. 

20 The inventor's search for a common insertion site motif failed to uncover any 

preferred single nucleotides or groups of nucleotides that showed a higher incidence 
amongst the most intense signals on a denaturing gel or amongst the insertions isolated by 
transformation. Similarly, there were no apparent motifs common amongst the least 
preferred sites analyzed in the denaturing gel analysis. The lack of a sequence preference 

25 for insertions with this reaction is a v ery desirable result if it is to be employed as a highly 
nonspecific method for mutagenizing DNAs. 
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TnsC v:2rA : A Tool for /// vitro .Mutagenesis 

The impressive efficiency and low specificity o! the TnsABC " in vitro reaction 
makes the reaction an excellent tool for in vitro mutagenesis. The high efficiency of the 
reaction (i.e.. the high percentage conversion of donor substrate to double-ended simple 
5 insertions) is critical when considering how the recombinant DNAs will be recovered. The 
observ ation that the majority of the molecules resulting from a reaction that contain a 
junction between the donor DNA and the target DNA are double-ended simple insertions 
provides an advantage over alternative transposon-based insertional mutagenesis systems 
because large portions of the junctions seen in these reactions can be single-end joins 

10 (Rowland. S. J. et al. EM BO J. /V: 196-205 (1995)). This study has demonstrated that 
standard commercially av ailable E. coli competent cells are capable of repairing the 
characteristic gapped molecules formed as a result of a Tn7 simple insertion, provided the 
target DNA contains an origin capable of replicating in /:. coli. Thousands of isolates can 
be recovered from a single transposition reaction starting with sub-microgram quantities of 

15 donor and target DNA. I ligh efficiency cells should yield even greater numbers of isolates. 
Tn7 insertional mutants could be recovered from many different organisms as long as the 
target DNA carries information required to replicate in its respective host, the gaps can be 
repaired by the host, and DNAs can be reintroduced into the host with reasonable 
efficiency. 

20 Cosmid clones have been successfully mutageni/ed and recovered by the method 

just described. Pilot reactions were done using purified cosmid clones. But it would be 
very simple to mutageni/e an entire cosmid library and select for mutants by the same 
process. Replicons as large as 125 kb (a BAC. data not shown) have been successfully 
targeted and recovered. An earlier study of the inventor demonstrated that the ability of 

25 transposition machinery to recognize whether or not a potential target molecule already 
contains an insertion breaks down as the distance between two insertion sites increases 
(52). It has been shown that the degree to which a target molecule is "immune" to a second 
insertion has an inverse relationship to the length of separation of the sites of insertion. 
TnsC """ has demonstrated a sensitivity to immunity signals. 1 o date, the inventor has 
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seen very few examples of double insertions into plasmids in the 40-50 kh range, 
suggesting that this tool will be highly effective for mutageni/ing eosmids or plasmids in 
the 1-50 kb range. 



Example 3 
A Kit For Making Transposon Insertions 

10 

The kit provides transposon insertions into DNA in vitro. These insertions can be used to 
provide priming sites for DNA sequence determination, or to provide mutations suitable for 
genetic analysis, or both. 

15 Section A: Reaction Constituents 

Al) PROTHINS 

TnsA 30 |ig/ml in 10% glycerol 
TnsB 20 (.ig/ml in 25% glycerol 
20 TnsC| 2 7 100 |ig/'ml in 10% glycerol 

Proteins were kept at -70°C\ 

A2) BUFFER CONS n i l TINTS 



25 HHPHS 0.25 M pH 8.1 

Tris[Cl | 0.25 M pH 7.6 [can be omitted] 

BSA 10 mg ml 

tRNA 50 |.ig ml [can be omitted] 

DTT 1 M 
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ATP 



100 mM 



MuAcetate 



375 mM 



5 A3) TRANSPOSON DONOR PLASMID 

100 ng ml 

The essential features of the plasmid are deseribed above as containing the R6K conditional 
replicon. 

10 A4) CONTROL TARGET PLASMID 



This plasmid contains both pUc and Mi3 origins, a lacZ* MCS and amp. See the Figure 
legend for figure 10B. 



A5) SEQUENCING PRIMKRS 

NLC94 (SEQ ID NO: 13) 3 pmol 
NLC95 (SEQ ID N():7) 3 pmof jil 

20 Section B : (Can be supplied by user) 

B 1) LOR THE REAC TION in vitro 

water: Millicue or equivalent recommended 

Target DNA not carrying Kanamycin resistance (0.4 - 0.5 jig per reaction) 
Water bath or heat block. 30°C 
25 1 .5 ml microtubes or other vessel; one per reaction. 

B2) TOR STOPPING THE: REACTION 

when using chemically competent cells 

Water bath or heat block. 75°C. Note: not 65°C. 



pLITMUS28 



400 ng/ml 



(New England BioLabs, 32 Tozer Road, Beverly. Massachusetts, 01915) 
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when using electocompetent cells 

Distilled phenol equilibrated with TE or Tris pi I 8.0 
Chloroform equilibrated with IE or Tris pH 8.0 
EtOH for precipitation 
N a Acetate 3 M 

Water or 1 mM Tris pH 8 or TE 

B3) FOR RECOVERING INSKRTIONS: 
B3a) Transformable cells: 

Any standard E. coli strain can be used; we have used ER182K ER2502 and 
MCI 061 (New England BioEabs. 32 Tozer Road, Beverly, Massachusetts. 
01915). 

Any kanamycin-sensitive organism in which npt can be expressed can also be used 
with the KanR donor, including but not limited to. Salmonella, other enteric organisms. 
Haemophilus, Rhizobium, and Bacillus. With a suitably altered selectable marker on the 
transposon donor plasmid, any prokaryotic or eukaryotic organism into which exogenous 
DNA can be introduced, may be used to recover insertions. 

In this example, 

B3ai) Chemically competent ERI821 New England BioEabs. 32 Tozer Road. 
Beverly. Massachusetts. 01915 (2 x 10 transformants ug of LITMUS or similar plasmid) 
was used. A sample protocol for preparing these is provided below, section Dl 

In example 2 we show the use of 

B3aii) Electrocompetent MC1061 (ATCC" 53338) 

(7 x 10 ) transformants ug of pEH \U S-28 or similar plasmid). A sample 
protocol for preparing these is provided below, section 1)2 

60 
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Commercially available competent or eleetrocompetent cells may also he used. The 
method of determining competence of these preparations is provided below, section 
D3. 

5 

B3b) Outgrowth media: 

Rich Broth (I)4a below) or mSOC (D4d) without drug, or equivalent. 

0.4 ml per reaction; we recommend three reactions as a standard pilot 
experiment (see Section C below). 

10 

B3c) Selective media: 

Rich Agar with drug (D4b). or equivalent 

at least 1 plate per reaction; the standard pilot experiment described in section C 
require 6 plates, three with two drugs and three with one drug. 

Kanamycin is REQUIRED to select lor the transposon of the present example 
Ampicillin is used for the RECOMMENDED positive control. Carbenicillin can 
be substituted 

f or the example of Section C\ below, RB Kan Amp (3 plates) and RB Amp only 
20 (3 plates) are used. If the target plasmid carries some other drug resistance, the 

experimental reaction in the pilot experiment should be plated on Kanamycin 
plus that drug. 

B4) 1 OR DNA PREPARATION FOR SEQUENCING (see example 2): 

25 

Any standard procedure that ordinarily gives sequencing grade DNA. We have 
tested Qiagen spin columns and gravity How plasmid preparations. 
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Section C. Tn7 /'/; vitro transposition reaction protocol 

CI. Rl AC HON VOLl'MF = IOOliI 

C2. RI-COMMKNDKD PILOT HXPHRIMHNT 3 samples to bo carried through. 

5 

Tube 1 Hxperiinental (Target DNA. protein and donor plasmid added) 
Tube 2 Reaction positive control (pLIT\H T S28. protein and donor plasmid 
added) 

Tube 3 Reaction negative control (Target DNA added, no protein, donor added) 
10 Tube 2 is also used as a transformation positive control 

In this example, all tubes have pLITMUS28 as target (tubes 1 and 2 are duplicates). 
Tube 2 need not necessarily be included in every experiment. 

15 C3. MAKH I r P a mix using reagents of Section A: 

per reaction: 

(73.9 |il H : 0) - (volume of target DNA); in this example, target DNA is 1 |il 
20 10 |il Hepes(250mM pH 8.1) 

1 ^1 Tris(250mMpH7.6) 

0.5 ul BSA(lOmg'ml) 

2.1 ul tRNA (SOng'ml) 

0.2 ul DTT(IM) 
25 2 [i\ ATP(lOOmM) 

C4. DISPPNSf mix of step 3 to each tube (89.7^1) - (volume of target DNA) reaction; in 
this example, this is 88.7|il. 
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C5. ADD target DNA of section B (0.4(.iu) to tubes 1-3. In this example, this is 
pl.If Ml S2K. 1 ul. This works well tor plasmid targets. For cosmids. 0.5pg worked well 
when the eosmid was around 10 times the size of the donor (5.2kb) i.e. a molar ratio of 
around 2: 1 (donor to target). Increasing the ratio to 4:1 decreased the efficiency slightly. 

C6. ADD to each tube 



Tube 2 Tube 3 

ul 1.3 ul (40ng) 0 
ul 3 ul(20ng) 0 

ul 1 ul(100ng) 0 

0 5.3 ul 



Tube 1 

TnsA 1.3 
TnsB 3 
TnsCj27 1 
dl 120 0 

C7. ADD 1 ul donor DNA (O.lug pMCB40). 
times. 

T ube 1 Tube 2 
Donor pMCB40 I pi 1 ul 



Mix well by pipetting up and down a few 



Tube 3 
1 ul 



C8. INCT BATH 10 minutes at 30°C (assembly reaction) 

C c ). ADD 4 ul Mg.Vc (375mM) to each tube. Mix well by pipetting up and down a few 
times 

Tube 1 Tube 2 l ube 3 

MgAc 4 ul 4 ul 4 pi 



63 



K I A0I :iHSM~% 1 



\m nu \<> ;<"S'< :tis:< 




CIO. INCl BATH 1 hour 3()°C (transposition reaction) 

CI 1. HI-AT INACTIVATE 75°C 10 minutes. Note: 65°C is not adequate. 

5 CI 2. TRANSFORM using chemically competent cells (see procedure of section I)] ): 
a. Add 10 fil of the reaction mix to 100 ul competent cells thawed on ice. 
h. Incubate 1 h on ice. 

c. Heat at 37°C for 45 sec. 

d. Chill on ice 2 min. 

10 e. Dilute the transformation mix into 0.4 ml RB (total volume 0.5 ml). 

f. incubate 40 min at 37°C. 

g. plate 100 |il tubes 1-3 on Kanamycin-containing selective media. 

h. plate dilutions of tube 2 on medium selective for the target plasmid only: dilute 100 
fold (10 ^1/1 ml) and 1000-fold (1 |il/l ml) and plate 100 ul of undiluted and of each 

15 dilution (3 plates) 

In this example, selective medium was RB Kan (20 |ig/ml) Amp (100 pg/ml) (tubes 
1-3) and RB Amp (100 jag ml. tube 2). Competent cells were HR1821, chemically 
competent (Section Dl). 

20 

CM 3. Transformation result: 
On Kan Amp: 
Tube 1 285 colonies 
Tube 2 600 colonies 
25 l ube 3 0 colonies 

On .Amp only: 

Tube 2 confluent (undiluted) 
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Section I): Recipes and auxiliary procedures 

1)1) Chemically competent colls (A', coli): 

5 

a. Inoculate a single colony from an RB agar plate (see D4b) into 2 ml of RB (I)4a) 

in a plating tube. Shake overnight at 37°C. 

b. Subculture the overnight 1 : 1 00 in 1 Volume Unit of RB+20 mM MgS0 4 
10 (typically 250 ml). Grow to OD 5 w=0.4-0.6 or Klett=60 (-2-3 h). 

c. Centrifuge 5,000 rpm 5 min at 4°C. 

d. Gently resuspend pellet in 1/2.5 Volume Unit ice cold TFBI (see below, 
15 D4Q. Keep all steps on ice and chill all pipets, tubes, flasks, etc. from 

this point on. 

e. Incubate on ice for 5 min. 

20 f. Centrifuge 5.000 rpm 5 min 4°C 

g. Gently resuspend pellet in 1 25 original volume cold TFB2 (D4g). f or 250 ml 
of original subculture, use 10 ml TFB2. 

25 h. Incubate on ice 15-60 min. before aliquoting 100 pi tube for storage at 

-70°C. Quick-freeze the tubes. 

i. To transform, thaw an aliquot on ice; add DNA; incubate 1 h on ice; heat shock 
45 seconds at 37°C; incubate on ice 2 min; dilute 5-fold into RB with no drug (for 
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phenotypie expression); grow with vigorous aeration at 37°C tor 20 min.: plate on 
selective medium. 

This procedure works with most strains and should routinely give >1() ctu jig of 
pLITMl T S28 (using 0.1 ng/transtbrmation). Frozen cells last at least a year. 
1)2) Hectroeompetent cells (E. coli) 

D2a. Rationale and comments 

This procedure prepares cells for use in gene transfer employing an electroporator 
device such as that supplied by BioRad. DNA is introduced into cells by means of an 
electric field. 

Successful electroporation requires a low electrolyte concentration, to avoid arcing (and 
cell killing) in the device. Cells are grown to midexponential phase, washed extensively in 
distilled water and sterile 10% glycerol, concentrated 500-fold in glycerol, aliquoted and 
stored at -70°C. 

Any strain can be used for this purpose, although some strains are said to give larger 
numbers of transformants. Resuspended cells should be well-dispersed for best results. 
Some strains resuspend more evenly in the low electrolyte solutions; some lyse under these 
conditions with rough treatment. 

The electroporation procedure itself involves transfer of the thawed cells to an 
electroporation cuvette (which has leads that contact the device appropriately), addition of 
DNA. imposition of the electric field, recovery from this treatment (by incubation in broth), 
and plating selectively. 

Efficiency of transformation with this method is 100-500 fold greater than with 
standard transformation. It is therefore especially suitable when low transformation 
efficiency is expected or large numbers of transformants are desired. The method is said to 
be especially suitable for introduction of large DNA molecules. 
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D2b. Preparation of electrocompetent E. coli cells (from BioRad recommended procedure ) 
i. Materials for 2 ml of electrocompetent cells (20 aliquots, 100 



overnight culture of desired strain 1 ml 
5 (in Rich Broth (D4a) or Luria Broth (D4c)) 

Luria Broth (I)4c) 1 L 

dILO. sterile, 4°C or 0°C 1.5 L 

10% (vv/v) glycerol, sterile (D4h) 22 ml 

1 L sidearm flasks 2 

10 250 ml centrifuge bottles 6 

50 ml Oak Ridge centrifuge tubes 2 

1.5 ml microtubes, polypropylene 20 

Pipet tips (sterile) for P200 or equivalent 20 

Sterile glass or plastic pipets, 25 ml 3 
15 Klett-Summerson colorimeter 

High speed centrifuge (e.g. Beckman J21 ) 
Micropipetter. e.g. Gilson Pipetman P200 



Water bath rack that can be used to immerse tubes in liquid nitrogen. 
Liquid nitrogen bath for quick freezing 

20 

ii. Procedure for making electrocompetent cells 

Be sure the sterile dl LO and 10° o glycerol is cold. 
If necessary, distribute the Luria Broth to sidearm flasks, 500 ml- flask 
25 Inoculate each flask with 0.5 ml of the overnight culture 

Incubate with shaking until Klett= c >() ( 5 x 1() S cfu ml). Quick conversion if Klett is not 
available: 1 OI) 1 50 Klett Units; 10*' cells 1.1 OD) 
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Chili on ice with swirling, until cold. It is very important to keep everything cold 
from this point on. 

Transfer to centrifuge bottles, 167 ml bottle or as desired. 

Centrifuge 4.000 rpm 15 min 5°C in JAM rotor in Beckman. Decant supernatant. 
5 Resuspend gently in equal volume (II. total) cold sterile distilled water. Keep in an ice 

bath while resuspending. Repeated pipetting will help; chill pipets for this use. 
MCI 061 cells (FR1709) can be kept on ice at this stage for at least an hour 
Centrifuge 4,000 rpm 15 min 5°C in JA14 rotor in Beckman; decant supernatant. 
Resuspend gently in 1/2 volume cold sterile distilled water (0.5 L total). Keep in an ice 
10 bath w hile resuspending. Cells can now be combined into three bottles if desired. 

Centrifuge 4,000 rpm 15 min 5°C in JA14 rotor in Beckman. Decant supernatant. 
Resuspend in 1 /50th volume cold sterile 10% glycerol (20 ml total). Keep cold while 
resuspending. 

Transfer entire amount to a 50 ml Oak Ridge tube (35 ml capacity ). 
15 Centrifuge 4,000 rpm 15 min 5°C in JA17 rotor in Beckman, with balance tube. Decant 

supernatant 

Resuspend in 1 /500th volume (2 ml total) cold 10% glycerol. Keep cold. 

Distribute 100 (il/tube to microtubes in ice water bath rack; immerse rack in liquid NV 

transfer to box; store at -70°C 

20 

D2c. Procedure for clectroporation of poratable /:. colt cells (from BioRad recommended 
procedure ) 



D2ci. Materials (per clectroporation reaction) 

Hlectrocompetent cells 100 ul 

18x1 50 mm culture tubes 1 

Idectroporation cuvettes (BioRad cat" 1652086 or equivalent) 1 

mSOC (see l)4d) 1 ml 

Pasteur pipets. sterile 1 

68 
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DNA to be transformed: in low ionic strength medium, e.g. dH : () or TH 
(see D4i). 

Hlectroporator (BioRad Gene Pulser or equivalent) 
Ice bath trays lor cuvettes and outgrowth tubes 
5 Rollordrum in 37°C incubator or other means of incubating culture tubes 

Selective agar plates and plating materials 
37°C or suitable temperature incubator 

D2cii. Procedure 

10 Be sure all materials are set up ready to go before getting cells out of the freezer. The 

DNA must be added and the electorporation done as soon as the cells are thawed; cells will 
lyse after a short time, resulting in arcing as the medium becomes more conductive. 

Chill cuvettes and hold on ice (>5 min). Transformation efficiency declines at least 

100-fold if cuvettes are at room temperature 

15 

Set BioRad Gene Pulser to 25 jaF capacitance. 2.5 kV. and the pulse controller to 200 LI 

(maximum voltage) 
Thaw electrocompetent cells at room temperature and transfer to ice. 
In a cuvette mix 40 |itl cells with 0.4 pg-0.3 |ig DNA. Shake the suspension to the 
20 bottom of the cuvette, rap on table to shake loose air bubbles. 

Place the cuvette in the holder 

Apply one pulse by pushing both red buttons until a beep is heard. This will result in a 

pulse of 125 kV cm with a time constant of 4-5 sec. 
Immediately add 1 ml mSOC to the cuvette and gently but quickly resuspend the cells. 
25 A PI 000 with sterile blue tips or sterile pasteur pipets can be used for this. A 1 min 

delay in adding the medium results in 3 fold decrease in transformation efficiency. 
Transfer cells to culture tube. 
Incubate 37°C 1 hour 
Plate on selective media. 



r r.v> i :ms5P\ t 



69 



\n\ Dki No ?5^xu Mis:> «>"s^-? \ 




D3) Standardi/ation of transformation or electroporation 
D3a. Rationale and comments 

5 

To ensure that gene transfer is successful, we recommend that the cells prepared above 
(Dl or D2) or purchased commercially be transformed with a standard DNA dilution 
series before experimental use. Below is an example of such a standardization for 
electrocompetent cells (D2). Chemically competent cells will yield 100-500 fold fewer 
10 transformants, so dilutions given below should be appropriately adjusted. 



D3b. Materials for a standardization experiment 

15 Dilutions of standard DNA. usually a high-copy small plasmid (e.g. LITMUS28). in 

TL:: 

A 1 ng/|il 

B 10 pg/|il 

20 C 1 pg/|il 

D 100 fgjd 

Selective agar plates; RB 1.5% Amp 100 ng ml tor pUTMUS28 12 

Dilution medium, usually 0.85% saline 7ml 

25 Dilution tubes, usually 13 x 100 mm 7 

Sterile plastic or glass pipets. 0. 1 ml 1 0 

Sterile plastic or glass pipets. 0.2 ml 1 

Sterile plastic or glass pipets. 1 ml 1 
Micropipetters. e.g. P200 and P20 or 1M0. for DNA transfer and dilution 
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scries 

Pipet tips tor P200 and P20 or P10 
Spreader 

Hthanol or isopropanol for flaming the spreader 
5 37°C incubator 



D3c. Procedure for standardization experiment 

D3ei. Set up dilution tubes below and label plates beforehand or while cultures are 
growing out. 

10 D3cii. Carry 1 out electroporation as above (D2) with DNA dilutions A-D 

D3ciii. Place cultures on ice to prevent further growth while making dilutions and 

plating as below. 
D3civ. Dilute in saline: 



Sample A 10' 1 . 10 \ 10" 4 

Sample B 10" 1 . 

SampleC 10" 1 

Sample D no dilutions 



20 

This can be carried out as: 

10' 1 dilution: 100 ^1 sample -r 900 ul saline 
10" : dilution: 10 (.il sample ^ 1 ml saline 
10" dilution: 10 (.il 10" 1 dilution 1 ml saline 
25 10" 4 dilution: 10 \i\ \(Y 2 dilution a 1 ml saline. 
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I)3cv. Plate on selective media by spreading; flame the spreader after each plate: 



Dilutions: undiluted 
Samples: 



A 
B 
C 
D 



10 



0.1 ml 
0.1 ml 
0.1 ml 
0.2 ml 
0.5 ml 



10"' 

0.1 ml 
0.1 ml 
0.1 ml 



1 0" 



10" 



0.1 ml 0.1 ml 0.1 ml 

0.1 ml 



D3vi. Hxample of result: 



Dilution/vol 



Transformants 



Sample DNA added 



plated Colonies per ml per ug 



A 



1 ng 



1/0.1 Confluent 

2/0. 1 very numerous 

3/0.1 -1000 

4 0.1 71 7x10" 7x10" 



B 



lOpg 



0 0. 1 very numerous 

1 0. 1 405 

2 0.1 4<) 4x10' 4x10" 

0 0. 1 very numerous 

1 0.1 106 1 x 10 4 



1.1 x 10 



in 



I) 



1 00 tii 



()■'(). I 5()() 
0 0.2 173 
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0 0.5 75 



8x 10 : 



Average transfbrmants ug 7.6 \ Hf 

1)4) Recipes 
Bacteriological 

I)4a)RB. per liter 



Tryptone (Difco) 10 g 

Yeast Extract (Difco) 5 g 

NaCl 5 g 

NaOH(l N) 2 ml 
Autoclave 

D4b) RB Agar with drug, per liter 

Tryptone (Difco) 10 g 

Yeast Hxtraet (Difco) 5 g 

NaCl (Baker) 5 g 

NaOH (1 N) 2 ml 

Agar (Difco) 15 g 
Autoclave 



Drugs: add after autoclaving and cooling to 55°C\ per liter: 
kanamycin(RHQWIRHD) 20 mg 

Other drugs that MAY be added, per liter; choice depends on target plasmid 
Ampicillin or carbenicillin 100 mg 
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Chloramphenicol 
Tetracycline 



1 5 nig 
1 5 mg 



Others drugs not tested but presumably usable in an appropriate host strain: 
Spectinomycin 
Streptomycin 
Gentamycin 
Hrythromycin 

Rifampicin (recessive marker) 
Bleomycin 

Other antibacterial small molecules 
D4c) Luria Broth, per liter 



Tryptone 1 0 g 

Yeast extract 5 g 

NaCl 10 g 

MgC12-6H : () 1 g 

glucose 1 g 



Aliquot and autoclave. For preparing electrocompetent cells (C2) it is convenient to 
aliquot 500 ml flask in 1 L sidearm flasks before autoclaving. 

D4d) mSOC. per liter (modified from BioRad recipe) 
Luria Broth 1 L 

MgS() 4 . 1M sterile 10 ml 

40°b glucose, sterile 6.5 ml 

Add MgS() 4 and glucose sterileh to sterile Luria Broth 
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I)4e) 0.85% saline, per liter 

NaC'l S.5 g 

Distribute in suitable aliquots. autoclave. 

s 

Buffers and storage media 

D4f). TI-BI 

30 mM KOAc (potassium acetate) 
10 lOOmMRbCl 
10mMCaCl 2 
50 mM MnCl 2 
1 5% glycerol 

15 Adjust to pH 5.8 with acetic acid and filter to sterili/e. It is convenient to make this 

as: 

5 g RbCl (Alfa) 
12.3 ml KOAc 1 M 
4.1mlCaCl ; 1X1 
20 20.5 ml MnCN 1 M (this is pink) 

61.5 g glycerol; pH to 5.8 with < 8 ml HOAc 0.1 M 

make up to 410 ml: distribute in 100 ml sterile aliquots: and use 1 aliquot 250 ml 
culture. 

25 

I)4g). TFBII 

10 mM MOPS 
75 mM CaC'l: 
10 mM RbCl 
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1 5% glycerol 



Adjust pH to 6.5 with KOI I and filter to sterili/e 
Make up as 

I. 5 ml MOPS 1 M pll 6.5 (this is yellow) 

II. 25 ml CaCl : 1 M 
1.5 ml RbCl 1 M 
22.5 g glycerol 

pH with 1 N KOH; make to 150 ml, filter; use 10 ml per original 250 ml culture 

D4h) 10% glycerol, per liter 

Glycerol 100 g 

dll : 0 1 L 

Aliquot and autoclave 

I)4i) TK. per liter 
1 M Iris pll 8.0 
0.5 M KDTA pH 8.0 

Example 4 



10 ml 
2 ml 



Random Insertion of Primers For Sequencing 



Section A: Components used for transposition reaction 

Al) PROTHINS 



TnsA 40 fig ml in 10° o glycerol 
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TnsB 20 ug'ml in 50° b glycerol 
TnsC'i:- 100 ug ml in 50" o glycerol 



Stored at -70"C 



A2) BITITRC ONSI 111 T\ IS 

HHPHS 0.25 M pH 8.1 

Tris[Cl] 0.25 M pH 7.6 [can be omitted] 

BSA lOmg/ml 
10 tRNA 50 ng/ml 

DTT 1 M 

ATP pH7 100 mM 

MgAcetate 375 mM 

TnsD storage buffer TnsD is stored in the following buffer: 3.3ul 
is 500 mM KC1. 50 mM Tris-HCl (pH 8.0). 

1 mM EDTA. 2 mM D I T and 25% glycerol 

A3) TRANSPOSON DONOR PI.ASMID 

20 pHM delta R.adj to 1 50 ug'ml 

(Sequence appears in Figure 9B and SHQ ID NO:3)) 

A4) TARGIT PLASMID 

1 ) pi R 1 83 mini-cleared lysate 200 ug ml 

25 2) pi Rl 83 C'sCl preparation 400 ug ml 

3) P RM2 4()0ugml 
(Sequence of pHR183 appears in figure 10A and SHQ ID NO:5) 
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Section H: Components used for processing reaction 



Phenol chloroform equilibrated with TH 
Phenol equilibrated with I ris pll 8.0 
5 NaAcetate 3 M 

lithanol (HlOH) 

list Ell New Kngland BioLabs. 32 Tozer Road. Beverly. Massachusetts. 01915 
DNA Polymerase I Holoenzyme New England BioLabs. 32 Tozer Road. Beverly. 
Massachusetts. 01915 

10 14 DNA Ligase New Kngland BioLabs. 32 Tozer Road. Beverly, Massachusetts, 01915 

1 OX Fi/L buffer (section 13) 

10X Buffer 3 (NKB#007-3) New England BioLabs. 32 Tozer Road. Beverly. 
Massachusetts, 01915 
tRNA 1 mg/ml 
15 DNA buffer (section 12) 

TL (section 14) 



Section C: Components used for recovery of insertions 



20 MCI 061 electrocompetent cells (made and used as in Kxample 3. D2 and D3) 
Selective media (made and used as in Kxample 3, D3 and D4) 



Section 1): Components used for sequence determination 

25 

dm si on nunc; primlrs 

NLC94 3.2 pmobjil. 

Sequence of this primer (SHQ ID NO: 13): 
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5' AAACrrCCAG I A I CiC T I I r TCACACiCA I AAC 

NLC95 3.2 pmol ul 

Sequence of this primer (SHQ ID N():7) 
5 5' A I AATCC I FAAAAAC' rCT , ATfrCX 1 ACCCC I' 

D2) QIAprep spin miniprfp kit (Qiagen Cat # 27106) 

D3) ABI Sequencer (info) and reagents 

10 

Section E: in vitro transposition protocol 

HI) MAKE UP Mix: 



208.2 


Ml 


dH 2 0 


30 


Ml 


Hopes (250 mM pi I 8.1) 


-1 
.1 


Ml 


I ris (250 mM pH 7.6) 


1.5 


Ml 


liS A (10 mg/ml) 


6.3 


Ml 


tRNA (50pg/ml) 


0.6 


Ml 


DTT(IM) 


6 


Ml 


ATP(lOOmM) 



20 



1-2) DISPHNSK 85.2 pi to three tubes 
1:3) ADD target DNA of A4. 2 ul 
1:4) ADD to each tube 





lube 1 


Tube 2 


Tube 3 




TnsA 


1.3 pi 


1.3 pi 


1.3 M l 




TnsB 


1 pi 


1 Ml 


1 M l 




I'nsCr 


1 pi 


1 Ml 


1 ul 




D buffer 








A 


Donor 


-> 
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1:5) INCCBA IT 30 minutes at 30°C (assembly reaction) 
1:6) ADD 4 |il MgAc (375mM) to each tube. 

5 

H7) INCUBATH 1 hour at 30°C (insertion) 

Section F: Reaction processing 

10 In this example, the transposon donor was capable of replicating in the host used for 

recover)' of insertions. Transformation of the reaction mixture on plates selecting for the 
transposon and the target markers might well result in many colonies with two different 
plasmids, rather than with a single plasmid containing both markers. For this reason, we 
digested the reaction with a restriction endonuclease cleaving in the donor replicon but not 

15 within the transposon or in the target DNA. In addition, we examined the consequences of 
repairing the strands not ligated by the transposition reaction, using DNA polymerase I 
holoenzyme and ligase. 

Per reaction (100 \xl): 



20 



PC extract: 



Add 100 phenol chloroform, vortex 



Centrifime 5' in micro fuee 



Backextract 



Remov e organic phase to a new tube with 100 |il TK: vortex 



Centrifime 5' in microfime 



Combine aqueous phases ( 185 ul total) 
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HtOH precipitate 

20 ul 3 M NaAe 
500 ^1 HtOH 



5 chill on dry ice 

Centrifuge 5 min in micro time 
Drain supernatant, air dry 
Resuspend in 100 (.il DNA buffer 



10 



Divide each reaction for further treatment (all volumes are 

Treatment: Repair Diget 

Digestion 

15 A B 

1 ) Repair/1 igation 

DNA 40 40 

IOXFi/L 5 
dll : 0 2 
:o Pol I (10.000 'ml ) 2 

a) Incubate 15 min room temperature 

Ligase (400.000 ami) 1 

25 

b) Incubate 4 h 16°C 
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2) Digestion 
1 M Nat l 
10 X butter 3 
listHII ( 10. 0(K) u ml) 



6.0 
1 



Incubate 6()°C 1 h 



10 3) Protein removal, buffer exchange 1 
Phenol, equilibrated 



50 



50 



15 



a) Mix. centrifuge 5' in microfuge 

b) Back extract organic phase with DNA buffer 
e) Combine aqueous phase 



Total volume, step 3c 100 

3 M NaAc 1 0 

tRNA 1 mg/ml 1 

20 HtOH 120 

a) Incubate 5 min room temperature 

b) Centrifuge, discard supernatant 

c) Wash twice with cold 70% KtOI 1 (100 ul) 



100 
10 

1 

120 



DNA buffer 

d) Resuspend 

final volume, step 3d 



50 



50 



50 



50 



K I \H! 2luX5l"vl 
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4) [Butter exchange 2 
Re-precipitation 

DNA from step 3d 35 35 

3 M NaAc 5 5 

KtOH 137.5 137.5 



a) Incubate -7()°C overnight 

b) Centrifuge, discard supernatant 
10 c) wash twice 200 (il 70% KtOH 

TK 50 50 

d) Resuspend 

15 Section G: Recovery of insertions 

Hlectroporated 10 jliI of samples into MCI 061 following procedure of Kxample 3. section 
1)3 
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0 


0 




Table 2 Sample codes, treatments, and target concentrations corrected for losses durin 


manipulation 






1 arget 1 reatment 


i argei selection 


[ i argei i ji\ .* v | 


Name 




\ 1 1 1 iui yii ) 


1 A pi ,ix 1 oJ I 1 L. l>Mg 


C am 




IB pKR183 Digested 


Cam 


0.05 


2 A pLR183 Fi L. Dig 


Cam 


0.98 


2B pHR183 Digested 


Cam 


0.42 


3 A pRM2 Fi/L. Dig 


Amp 


0.66 


3B pRM2 Digested 


Amp 


0.56 


Table V Colony forming units ner ml on 

1 C-l L'l v *J » V_ \J IV/ll t 1 V. ' 1 111111 {w, 111 I 4. J l_/ W 1 1111 \~/ I 1 


appropriate selective plates 




Sample l.\ IB 


2A 2B 


3 A 3B 


Donor (or recomb) 






Kan ()nl\ 1 30 1.8 x Mf 


5x10' 7\10" 


3 . 7 x 1 0 4 4 x 1 0 4 


Recipient 






Cam onl\ 1 \ 10 4 8 \ Hf 


4 x io 4 ft \ i<r 




Amp onlv 




3x10" 4x10" 


Colonies thiol ft \ 1 0 4 1.6 \ 10° 


4 x 10 ; 1,1 x 10' 


4.5 x Hf 7 x Uf 


Keeomhinani 






Kan Cam lo 2." \ 1() ; 


880 1 \ lo 1 




Kan Amp 




11 \ 10 s 4 \ IO 1 


Kecomh reeip I \ 10 ' 3 \ 10 ; 


2 x 10 : 1 x 10 1 


4 x 10 ; 1 v 10 


75 recombinant colonies were chosen. 31 


from samples 2A. 44 from s 


amples 2R. for 


further characterization 
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H. Determination of sequence location. 

I . Procedure sum man 

75 recombinant colonies were picked into 0.5 ml KB in racked array for storage. 
Subcultures of these storage cultures were grown with selection (RB Cam Kan), and 
5 minipreps made according to the directions of the manufacturer for large plasmids 

of low copy number. 



DNA concentration of the plasmid preps w as determined by comparison w ith a 
dilution series of linearized pLITMUS28 on agarose gels. Plasmid preps were 
10 linearized for this purpose with an enzyme that cleaves once in the target nlasmic 

and not in the transposon (SacII). 



Primers NLC94 (SEQ ID NO: 13) and NLC95 (SEQ ID NO:7) were used for 
sequence determination, using flourescently-labeled didcoxynuclcotidc sequencing 
15 reagents from Applied Biosystems. 



Sequences were run on an ABI sequencer, and sequence acquisition, editing and 
assembly was carried out with the supplied programs (SFQFD, FACTURA and 
AUTOASSHMBLK). 

Output is Fig. 1 1 



2. Results 



25 a. fable 4: Summary result of 75 recombinants (CamR KanR colonies), potential Tn7 

insertions into pi- Rl 83. 



Total DNA preps 75 
DNA concentration too low to attempt sequence: 7 
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Transformant contained two plasmids. not sequenced: 1 
Total not sequenced 8 

I)\A preps sequence attempted 67 

5 Sequence unreadable (miscellaneous reasons) 2 

Sequence unreadable because 2 insertions in one plasmid 1 

Total sequence unreadable 3 

DNA preps sequence obtained 64 
10 Sequence rejected (cross contamination of adjacent wells) 1 

Total insertions rejected 1 
Independent insertions for which location was obtained 63 
Number of insertion locations 62 
Number inserted clockwise 33 
15 Number inserted counterclockwise 30 

Aberrant insertions 

Number of insertion plasmids with structural aberrations 1 

This was a deletion far from the insertion 
Number of structural aberrations associated w ith insertion site 0 
20 Number of insertions with disagreement in 5 bp duplication 2 

These were: 



G-->A transition mutation in one copy with respect to target plasmid sequence 
G— > T transversion mutation in one copy with respect to target plasmid 
sequence. 

b. Analysis of the distribution of insertions among sequences and intervals. 

For the purpose of obtaining maximum sequence from an unknow n target, it is 
desirable that the insertions be distributed as randomly as possible with respect to 
regions of sequence and with respect to specific sequences. The summary of Table 4 
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already suggests a very random process, since 63 independent insertions hit 62 different 
locations, i.e. no hotspots for insertion were identified. [ ; or comparison, relaxed- 
specifieity derivative of Tn 10 (A TS2. examined with in vivo insertions into the lac 
operon) hit 23 sites with 50 insertions. 

5 

Primary data for further analysis below is found in Table 5. which gives the location of 
all the insertions, their orientation with respect to the target plasmid. and the sequence 
immediately adjacent to the insertion (the five bp sequence duplicated by the insertion 
mechanism) in a uniform frame of reference. 

10 

Table 5: Insertion locations and associated 5 bp duplication 



Directions Sequence at position orient- 

relative to Tn7R ation 

Isolate Sequence Location Insert 12 3 4 5 (Tn7 R 

obtained name clock- 

wise 
• ) 



1 


1 


6464 


A 5 


A 


G 


c 


r 


C 


2 


"> 


8428 


A6 


C 


T 


G 


G 


7 


3 


1 


8349 


A7 


C 


C 


T 


G 


A 


4 


"» 


5515 


A8 


T 


A 


A 


C 


r 


5 


™i 


7822 


A9 


C 


C 


C 


Ci 


c 


6 


*> 


365 


A10 


T 


C 


A 


A 


c 


7 


~> 


56^5 


Al 1 


I 


C 


A 


r 


G 


9 


"i 


2500 


Bl 


G 


G 


A 


T 


G 


10 


1 


8286 


B2 


e 


T 


I 


C 


r 


1 1 


1 


2764 


B3 


c 


r 


r 


7 


A 


12 




6953 


B4 


c 


G 


A 


G 


G 


13 




3414 


B5 


c 


7 


i 


1 


g 


14 


1 


3 139 


B6 


T 


C 


G 


I 


I 


15 


1 


3208 


B7 


G 


c 


A 


C 


i 


16 


■*> 


4208 


B8 


A 


G 


A 


G 


G 


17 


-» 


367 1 


B9 


G 


r 


T 


T 


A 


18 


■> 


5563 


B10 


C 


c 


A 


A 


G 


19 


1 


3539 


Bl 1 


G 


c 


I 


T 


G 


20 




3803 


BI2 


A 


7 


1 


C 


G 
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21 


- 


8474 


C! 


c 


C 


G 


c 


c 


--■ 




566 1 


C 2 


A 


r 


i. i 


A 


I 

1 


2 3 


- 


"'603 


C > 


C 


C i 


C 


( i 


1 


24 


- 


3205 


C4 


I 


c 


1 


1 


c 


2 ^ 


- 


1 650 


C5 


c 


c 


1 


\ 


1 


26 


- 


8020 


C6 


G 


c 


C 


1 1 


G 


27 


- 


2566 


C7 


A 


I 


I 


1 


I 


20 


- 


22 7 > 


CO 


Ci 


(J 


c 


C 


A 


30 


1 


6368 


CIO 


G 


c 


1 


\ 


T 


32 




2620 


C 12 


I 


\ 


1 


\ 


C 


3 3 


- 


5088 


D 1 


G 


G 


c 


Cj 


A 


34 




3400 


D2 


A 


1 


G 


I 


A 


35 


~) 


3033 


1)3 


I 


I 


G 


A 


T 


36 


2 


60 7 7 


D4 


G 


r 


I 


G 


T 


37 


~) 


6756 


D5 


T 


r 






V J 


38 


■*) 


5563 


D6 


G 


r 


T 


G 


G 


38 


2 


8224 


D7 


G 


Cj 


A 


G 


(3 


40 


*) 


3123 


D8 


C 


A 


A 


A 


T 


41 


1 


2746 


DO 


A 


A 


A 


A 


C 


42 


1 


1646 


D10 


C 


G 


A 


G 


A 


43 


1 


5678 


Dl 1 


A 


I 


G 


r 


Cj 


44 


i 


7406 


D12 


T 


(.; 


c 


A 


T 


45 




1744 


Rl 


G 


c 


c 


A 


T 


46 


- 


3584 


L2 


I 


A 


G 


G 


T 


47 




2112 


K3 


C 


C 


r 


A 


C 


48 




4205 


1:4 


G 


c 


A 


G 


c 


40 


1 


2708 


K5 


G 


c 


G 


G 


T 


50 


~> 


7828 


1-6 


/\ 


c 


A 


G 


A 


52 




3873 


K8 


A 


G 


T 


C 


I 


53 


- 


350 1 


[-0 


C 


A 


T 


G 


C 


56 


- 


5550 


[•12 


A 


T 


C 


G 


C 


57 


- 


2702 


n 


r 


T 


c 


A 


C 


61 


- 


4400 


K5 


G 


T 


T 


A 


A 


62 


- 


581 1 


16 


A 


C 


G 


C 


G 


63 


- 


2024 


\— 


A 


c 


T 


Ci 


T 


64 


- 


1470 


I'S 


A 


I 


C 


G 


T 
1 


66 


- 


5675 


1 1 0 


T 


1 


1 


A 


I 
1 


67 


- 


5208 


1- 1 1 


A 


T 


A 


A 


A 


68 




6020 


112 


Ci 


G 


r 


A 


A 


60 




6264 


C.l 


G 


A 


G 


T 


A 


70 


■> 


3881 


C,2 


A 


T 


T 


r 


G 


72 


-> 


2 SO 1 


C.4 


A 


T 


r 


c 


G 


74 


-> 


1 68 1 


(,6 


A 


C 


T 


c 


T 


76 


■> 


5315 


cr 


A 


A 


r 


A 


C 



S8 
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Table 5 legend : 

Isolate: Number of the colony 

Directions sequenced: 1 " only one direction from the insertion: 2^ both directions 
5 Position: coordinate on pER183 (SEQ ID N():5) top strand of the first base of the 5 bp 

duplication 

Insert name: accession number in notebook 

Sequence at position ~: position 1 is the base immediately adjacent to Tn7R top strand 
(i.e. it can be either the top or the bottom strand of pERl 83 (SEQ ID NO:5)): 
10 position #2 is the next but one to Tn7R; and so forth. 

Orientation, of the insertion relative to the top strand of pER183 (SEQ ID NO:5). +, 
Tn7R is to the right of Tn7L when displayed on the top strand of pER 183 (SEQ ID 
NO:5). Tn7R is to the left of Tn7L. 

15 1 . Distribution of insertions fits the Poisson distribution 

1. These insertions are randomly distributed as judged by the fit of the interval 
distribution to the distribution predicted by a Poisson process. 

20 The Poisson distribution gives the probability of observing exactly X, events 

(insertions) in a unit (interval) when the average number of events per unit is [\ (from Zar. 
J.I I. Biostatisiical Analysis Prentice-Hall. Englewood Cliffs. NJ 1974 p. 301 ). 

hX. „ V ," 

25 cq 1 

Xj! 
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Where 

exactly Xj insertions per interval 

u = average number of insertions per interval 



Let 

Xj= number of insertions in a 100 bp interval 

fj= Observed number of 100 bp intervals with Xj insertions/interval 
11 - number of 100 bp intervals in the set (=73) 
10 n =1 fj Xj/I fj = 63/73 



P (xi) = probability of finding Xj insertions in a 100 bp interval (from the Poisson 
distribution., eq 1 ) 

F, = P (x i, n = expected number of intervals with i insertions. 

15 

From the data in Table 5 and eq 1 we can construct the following comparison of 
expected and observed data: 
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able 6. Observed and expected distribution of insertions in 100 bp interv als 



10 



Insertions 
per interval 



4 



Observed 
interv als with 
X\ insertions 

f, 

34 
24 
9 



Probability of 
X] insertions per 
interval 

P(X,) 

0.42189 

0.35410 

0.157! ! 

0.04520 

0.00975 



Fxpeeted 
number of 
intervals with 
Xj insertions 

I ; , 

30.80 
26.58 
11.47 
3.299 
0.712 



These distributions are illustrated in Figure 12, where 11= observed distribution. Fi 
expected distribution. The lit looks good to the eye. 

b. Statistical test of fit between observed and expected distributions 



To test whether the observed and expected distribution are statistically 
indistinguishable, we used a Chi-square test for goodness of fit (from Zar, J. 11. 
Biostatistical Analysis Prentice-Mall. Englewood Cliffs, XJ 1974 p. 303). For this 
purpose we pool the tail of the distribution so that no expected number is less than 4. 
Rewriting Table 6. we obtain 
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10 



20 



Table 7. Chi-square test of goodness of fit to a random distribution 
Insertions per Observed intervals Hxpeeted number 

with X| insertions 



(f, 



Interval 

X, 

0 



t; 

34 
24 

9 
6 



of intervals with X, 
insertions 
F, 

30.80 

26.58 

11.47 

4.1541 

Sum 



Chi-square 

0.3329 

0.2504 

0.5315 

0.8209 

1 Q44 



The null hypothesis is that the observed distribution was drawn from a Poisson 
distributed population. For two degrees of freedom this sum of chi-square values gives 
a probability that this is the case of 0.25<p<0.5. The null hypothesis is not rejected. 

15 In sum. the eye (part a. Fig. 12) and a statistical test (1 able 7 and following) agree that 

the distribution of insertions in intervals along the DMA is random. 

ii. Analysis of the base composition of insertion sites. 



Site preference of Tns \BC12~for insertion of miniTn7 into pER183 



Certain bases are preferred at some positions in the five-base insertion site duplication, as 
shown in a histogram of base incidence versus position in the site (Figure 13). taken from 
the data in Table 5. In collating the data tor this histogram, the five duplicated bases were 
assigned position numbers relative to Tn7R; position one is the base immediately adjacent 
to Tn7R when the sequence is displayed with Tn7R on the right and Tn7L on the left. The 
orientation of the transposon relative to the target sequence during target choice is thus 
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controlled for: the target is displayed in the same way relative to the transposon for all 
insertion sites. 



A model for a preferred site was formulated: NYTRN. The elements of this site w ere tested 
5 for statistical significance individually and collectively by chi-square analysis (Table 8). 
The null hypothesis was that sites were draw n randomly from the universe of sequence 
defined by the sequence of pHR183 (SKQ ID NO. 5) after deleting sequence subject to 
selection (bp 1-250 and 2481-2509. CamR: and 581-1400. replication origin). Expected 
frequencies of the four bases, of purines and pyrimidines. and of trinucleotides w ere 
10 derived from frequencies obtained for pERl 83 (SEQ ID NO:5)-condensed by the GCG 
program COMPOSITION. 

Table 8. Chi-square tests (tests that differ from random expectation (p<0.05) in bold) 



15 f our bases individually, all sites collectively (315 bp experimental, 7410 bp control) 
Base Expected Observed Chisquare probability 

A 78.4 73 .372 

C 74.3 77 .981 

G 76.2 72 .232 

20 T 85.7 93 .622 

2.21 ().5<p<0.75 



Four bases individually, each position individually (63 bp experimental. 7810 bp control) 
Position 1 

A 15.7 19 0.694 

C 14.9 15 0.00066 

(i 15.2 17 0.213 

I 17.1 12 1.52 

2.43 0.25- p< 0. 5 
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Position 2 

A 

C 

G 



Position 3 

A 

C 

G 



Position 4 

A 

C 

(i 



Position 5 

A 

C 

(i 

T 



15.7 
14.9 
15.2 
17.1 



15.7 
14.9 
15.2 
17.1 



15.7 
14.9 
15.2 
17.1 



15.7 
14.9 
15.2 
17.1 



1 1 



15 
11 

12 

25 



19 



20 



IS 



3.8 

3.38 

1.16 

1.4 

9.74 

0.031 

1.02 

0.674 

3.65 

5.37 

0.(>93 

1.02 

1.52 

0.983 

4.21 

0.872 
0.645 
0.674 
0.889 
3.08 



0.01<p<0.025 



0.1<p<0.25 



0.1<p<0.25 



().25-p<0.5 



K I MM ;inX5P\ I 



94 



\it\ Dki \.> ;rs" :-nx:< , ^xo..-, .\> 



Purines and P\rimidines. each position individually (63 hp experimental. 
7410 hp control) 

Hxpected Observed 



10 



Base 
Position 1 
R 
Y 

Position 2 

R 

Y 



Position 3 
R 

15 Y 

Position 4 

R 

Y 



20 



Position 5 

R 

Y 



30.9 



30.9 
32.1 



30.9 



30.9 



?0.9 



25 T or not- 1 . position 3 
T 17.16 
not-T 45.84 



36 
27 



19 
44 



27 
36 



39 
24 



24 

39 



38 



Chisquare probability 



0.842 
0.810 
1 .65 

4.58 
4.41 

8.99 

0.49 

0.422 

0.914 

2.12 
2.04 
4.17 

1.54 
1.48 
3.10 

3.58 
1.34 
4.92 



0.1<p<0.25 



0.001<p<0.005 



0.25<p<0.5 



0.025<p<0.05 



l).05<p<0.1 



().025<p<0.05 



95 



\lt\ Dkl Ni' v*"Xw :tix;< i5"S»..V\ 



Triplets, positions 234 (63 experimental triplets. 7408 control triplets to 
determine expectation ) 



10 



All triplets 

Triplet 

YNR 

RNY 

RNR 

YNY 



Hxpected 
15.% 
15.97 
14.98 
1 6.07 



Observed 



14 
19 



(hisquare 

7.54 

5.12 

0.064 

0.534 

13.25 



probability 



0.001<p<0.005 



Specific triplets, positions 234 
YNR 16 
Not YNR 47 



38 



5.06 

1.7 

6.78 



0.005<p<0.01 



RNY 
Not RNY 



16 
47 



s 

59 



7.56 
3.06 
10.62 



0.()01<p<0.0()5 



YTR 
not YTR 



3.93 
59.07 



10 



9.38 

0.623 

10.0 



0.001<p<().0()5 



Pairing between position 2 and 4 ((.INC. CN(i. ANT. TNA) 
Paired 16.95 16 0.053 

Not paired 46.05 47 0.0196 

0.073 



0.75- p- 0.9 
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Preference for this site was statistically significant (p- 0.005). and preference lor each of its 
parts was also significant (p* 0.05). However, the preference is not particularly strong, in 
that representation of the site w as only 2.5-fold more frequent in insertion sites than 
5 expected from the composition of the plasmid; and 53 out of 63 sites do not fit the 
consensus. Kaeh preferred position contributes independently to the overall preference, 
since multiplying together the overrepresentation of each position yields the 
overrrepresentation of the site as a w hole ( fable 9). 

10 Table 9: overrepresentation of preferred bases in Tn7 insertion sites 



Position preference expected observed Fold overrepresentation 

(Obs/Exp) 

2 Y 32.1 44 1.37 

3 1 17.6 25 1.42 
15 4 R 30.9 39 1.26 

product ((0/K)2 x (() H)3 x (()/L:)4) 2.46 

triplet YTR 3.93 10 2.54 



We conclude that insertion mediated by TnsABCi:? is extremely random, w ith only a slight 
20 preference for sites of the form NYTRN (SFQ ID NO: 15). 

I. Recipes. 

1. 100X DNA buffer per liter 
25 Iris Rase 121.1 g 

Dissolve in 700 ml 

4 MI ICl -90ml 
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Bring pi I to 7.4 

Na : HI)TA 37.2 
NaCl 29.22 g 

5 

Make up to -950 nil 
adjust pH 
Make up to 1 I. 
Aliquot, autoclave 

10 

2. 1 X DNA buffer 

100 xDNA buffer 1 ml 

dH 2 0. sterile 100 ml 

3. 10 X l'i/L (Fill-in. ligation) buffer 

15 10 X ligase buffer 1500 ul 

New England Biol.abs. 32 To/.er Road. Beverly. Massachusetts. 01915 

lOOmMdATP 3.75 ul 

New Hngland Biol.abs. 32 To/.er Road. Beverly. Massachusetts. 01915 

lOOmMdCTP 3.75 ul 

20 New England Biol.abs. 32 To/.er Road. Beverly. Massachusetts. 01915 

lOOmMdGTP 3.75 ul 

New England Biol .abs. 32 Tozer Road. Beverly. Massachusetts. 01915 

100 niM dTTP 3.75 ul 

New England Biol.abs. 32 To/.er Road. Beverly. Massachusetts. 01915 

4. it: 

1 M I ris pll 8.0 1 ml 

0.5 M ED I A pll 8.0 0.2 ml 

dlEO to 100 ml 
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Filter sterilise 



Example 5 



A convenient method for stopping a transposon insertion reaction 

5 In order to use DNA molecules with transposon insertions, they must be recovered in vivo. 
It is most convenient to be able to do this without the labor and losses associated with 
extraction with organic solvents and alcohol precipitation. Prior art has suggested, however, 
that transposition reaction products formed during in vitro insertion experiments are DNA: 
protein complexes that are extremely stable: evidence suggests that a chaperone-like 
10 activity is required for disruption of these products. Accordingly, organic extraction w as 
deemed required for satisfactory disruption of the complexes. 

This example demonstrates that heat inactivation at 75°C is adequate for disrupting these 
complexes or at least for putting them into a form that can be introduced into the cell by 
15 chemical transformation. 



TnsC !27 lOOug/ml in 10° o glycerol 

Keep stored at -7()"(\ Sufficient protein for 10 reactions is provided. At the time of use. 
keep fro/en on dry ice until ready to add to the reaction, and keep on dry ice until 



Section A. MATERIALS 



Al ) PROTLINS 



TnsA 30 ug/ml in 10% glycerol 



20 



TnsB 20 Uii/ml in 25% elveerol 



returned to the freezer. 



A2) BITTLR COXSTH ILNTS 



HFPHS 



0.25 M pll 8.1 



Tris[Cl] 



0.25 M pi I 7.6 
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BSA lOmgml 

tRNA 50 ug ml 

DTI 1 M 

ATP 100 mM 

5 MgAcetate 375 mM 



A3) TRANSPOSON DONOR P1.ASM1D 

100 ug/ml 

10 This is as described for Example 3. 

A4) TARGET PLASMID 

pEITMUS28 400 ug/ml 

is A5)OTHHR 

Millicue water 
Heat block. 30°C 
1 .5 ml microtubes. 

20 A6) IOR STOPPING THE REAC HON 

when using chemically competent cells 

Water bath or heat block. 75°C. 
Water bath or heat block. 65°C 
25 Distilled phenol equilibrated with I E 

Chloroform equilibrated with IT 
EtOI I for precipitation 
NaCl 3 M 

Water or 1 mM I ris pi I <S 
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CI IEMICAEEY COMPETENT TRANSFORMABLE CELLS: 



In this example, we show the use of 

Chemically competent ER 1821 (2 x K) 7 transtbrmants |Ag of I.ITMl 'S 
Chemically competent ER2502 (6 x l() h trans formants ug of LITMl 'S 



prepared as in Example 3 



10 A8) MEDIA 



Rich Broth and Rich Agar (Kan, Amp) prepared as in Example 3. 



Section B. Tn7 /'// vitro transposition reaction protocol 

15 

1 . Experiment 1 . Four stop treatments 

Reactions were carried out as in Example 3. using quadruplicate samples for each of four 
treatments. At step 12, one of these treatments was substituted. For transformation. ER2502 
was used. 

20 

Treatment 1 : No treatment. 

Treatment 2: I leat treatment at 65°C for 20 min 

Treatment 3: Heat treatment at 75°C for 10 min. 

Treatment 4: Phenol extraction once, chloroform extraction once, ethanol precipitation 
25 once, resuspension in original volume of I'E. 

The results of this experiment are given in Table c ) below and illustrated in Figure 14 
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Tabic 10: Translbrmants obtained per 1 50th volume of transposition reaction 
Replicate 

::2 *3 ~4 

nothing 1 0 0 0 

65C 20* 1 0 0 0 

75C 10' 32 24 22 10 

phenol-pptn 30 23 20 17 



2. Experiment 2. Three stop treatments 

Reactions were carried out as in Example 3. using duplicate samples for each of three stop 
treatments, for two aliquots of TnsB, and for three volumes of TnsB. At step 12. one of the 
stop treatments was substituted. For transformation. HR1821 was used. 



15 Treatment 1 : Heat treatment at 75°C for 10 min. 

Treatment 2: Hthanol precipitation only, resuspension in original volume of TH 
Treatment 3: Heat treatment at 65°C for 20 min 



The results of this experiment are given in fable 9 below and illustrated in Fig. 15 
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Table c >. Transformants obtained per 1 50th volume of transposition reaction, three stop 
treatments. 



s 




InsB 


75C 10 min 


HtOH pptd 


65C 20 ir 




Aliquot 


Volume 


*i 1 
n J 




** 1 
" 1 










1 


1 


158 


8 


13 




15 


10 




1 


1.5 


186 


0 


16 


0 


J 


0 


10 


1 


2 


178 


170 


13 


13 


30 


16 




I 


-i 


454 


366 


47 


21 


11 


8 




2 


1 


324 


140 


21 


3 


9 


-> 






1.5 


506 


462 


58 


44 


25 


25 




2 




1220 


1102 


88 


37 


54 


18 


15 




-> 


1802 


1 690 


129 


126 


37 


14 



These two experiments demonstrate that heat treatment at 75°C for 10 min is an adequate 
method of stopping the transposition reaction and gives as many transformants as treatment 
with phenol, chloroform and ethanol precipitation; whereas no treatment, ethanol 
precipitation alone, and heat treatment at 65°C for 20 min is inadequate, giving no 
20 transformants or a greatly reduced number of transformants. 
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Example 6 

Storing three components of Tn7 transposase together 

Convenient routine use of in vitro transposition as a method in molecular biology 
5 would be facilitated if the protein components of the reaction could be stored in a single 
tube. In this way. variability in volume measurement from one experiment to another 
would be minimized, time and labor would be saved, and reproducibility enhanced. The 

TnsABC]27 transposition reaction described in the foregoing examples involves the 

addition of three different protein components. 
10 This example demonstrates that these three protein components of the reaction can 

be mixed and stored together without interfering w ith the efficiency of the transposition 
reaction. 

Section A. MATERIALS 
1 5 Al) INDIVIDUAL PRO THINS 

TnsA 30 ng/ml in 10% glycerol 
TnsB 20 ng/ml in 50° b glycerol 
lnsCi: 7 100 ng/ml in 50° <> glycerol 

20 Keep stored at -70°C. 

A2) MIXED PROTHINS. COMPRISING 
TnsA 7.36 ng nil 
TnsB 1 1 .3 ii g ml 
25 TnsCi:? 18.9 ug'ml 

in 40° o glycerol 

A2a) Keep stored at -70 C. or 
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A2b) Keep stored at -20 C 

A3K)THhR COMPONENTS 

These are as in example 1. parts A and B; including chemically competent ER2502 (6 \ 
5 K)' 1 translbrmants/ng ofLITMl [ S) prepared as in example 1 . 

Section B. Tn7 IX VITRO TRANSPOSITION REACTION PROTOCOL 
Bl. Reaction volume = 100 yd 

B2. Experimental variations (2 experiments are shown, reactions were carried out in 
10 quadruplicate). 

Tube 1 Proteins of Al added individually at step 6 below in a total volume of 5.3 \x\ 
Tube 2 Mixture of A2a added together at step 6 below in a total volume of 5.3 [i\ 
Tube 3 Mixture of A2b added together at step 6 below in a total volume of 5.3 jlx 1 
(Experiment 2 only) 
15 B3. Make up a mix as in Example 1. section C 

B4. Dispense mix of step 3 as in Example 1. section C 

B5. Add target ON A as in Example 1, section C. In this example, this is pEITMUS28. 1 pi 
B6. Add to each tube 

20 l ube 1 Tube 2 Tube 3 



TnsA 



TnsB 



1.3 |il(40ng) 
^il(20ng) 



TnsC]27 



1 ul(lOOng) 



TnsABC|27 



0 



5.3 Lil (30ng A. 



5.3 ^il(39ng A. 



5 l )» nu B 



59.9 ng B 



100.2 ng C)27) 



100.2 ng C127) 
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B7. Add lul donor DNA (0.1 ug pMCB40) as in example 1C 
BS. Incubate 10 minutes at 30°C (assembly reaction) as in example 1C 
\W. Add 4 nl MgAc (375mM) to each tube as in example 1C 
BIO. Incubate 1 hour 30°C (transposition reaction) as in example 1C 
5 Bl 1. Heat Inactivate 75°C 10 minutes 

B12. Transform using chemically competent cells, as in example 1. 

In this example, selective medium was RB Kan (20 |ig'ml) Amp ( 100 |ig ml). 

Competent cells were KR2502. chemically competent (Example 1. Section 1)1 ). 

10 C. Transformation result: 

Experiment 1: Proteins were stored individually at -70°C or as a mixture at -70°C 
(A2a). In this experiment, the proteins in both treatments had suffered the same 
number of freeze- thaw cycles. 10 |il of each 100 \x\ reaction was transformed, and 100 
15 \x\ of the 500 (il outgrowth culture was plated. 

Table 10: Transformants obtained per 1 50th volume of transposition reaction, 
transposition proteins added as a mixture or individually. Result is displayed in Figure 16. 

Replicate 

Storage ~1 -2 ~3 &4 Average avg per 

reaction 

Individually 27 62 59 41 47 2350 

As a mixture 47 68 60 23 4 C ) 2450 

20 

I xperiment 2: Proteins were stored individually at -70°C\ as a mixture at -7()°C (A2a 
material) or as a mixture at -20°C (A2b material). In this experiment, the proteins stored 
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individually had suffered more free/e-thaw cycles than those stored together. 10 ul of each 
100 ul reaction w as transformed, and 100 ul of the 500 ul outgrow th culture was plated. 



Table 1 1: Transformants obtained per 1 5 
5 transposition proteins added as a mixture 
-70°C. Result is displayed in figure 1 7. 



l volume of transposition reaction, 
individually following storage at -20°C or 



Storage 



Individually 

As a mixture. 
-70°C(A2a) 

As a mixture, 
-20°C. (A2b) 



#1 



167 



179 



Replicate 

#2 #3 



38 
173 

125 



#4 



17 

117 218 

219 199 



average 



168 



180 



avg per 
reaction 

1100 

8400 

9000 



These two experiments demonstrate that the three I ns proteins can be stored together. The 
10 difference in experiment 2 between individual storage and storage together may be 
attributed to the number of free/e-thaw cvcles. 
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